five

kotoba-tech/streaming-mbd-experiment

收藏
Hugging Face2024-05-22 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/kotoba-tech/streaming-mbd-experiment
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: ja_asr.jsut_basic5000 features: - name: original dtype: audio - name: reconstructed_2codes dtype: audio - name: reconstructed_3codes dtype: audio - name: reconstructed_3codes.150chunks.120strides.crossfade dtype: audio - name: reconstructed_3codes.150chunks.120strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.150chunks.120strides.first dtype: audio - name: reconstructed_3codes.150chunks.120strides.first.enhancer dtype: audio - name: reconstructed_3codes.150chunks.120strides.last dtype: audio - name: reconstructed_3codes.150chunks.120strides.last.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.crossfade dtype: audio - name: reconstructed_3codes.150chunks.140strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.first dtype: audio - name: reconstructed_3codes.150chunks.140strides.first.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.last dtype: audio - name: reconstructed_3codes.150chunks.140strides.last.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.crossfade dtype: audio - name: reconstructed_3codes.75chunks.55strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.first dtype: audio - name: reconstructed_3codes.75chunks.55strides.first.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.last dtype: audio - name: reconstructed_3codes.75chunks.55strides.last.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.crossfade dtype: audio - name: reconstructed_3codes.75chunks.65strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.first dtype: audio - name: reconstructed_3codes.75chunks.65strides.first.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.last dtype: audio - name: reconstructed_3codes.75chunks.65strides.last.enhancer dtype: audio - name: reconstructed_3codes.enhancer dtype: audio - name: reconstructed_4codes dtype: audio - name: reconstructed_5codes dtype: audio - name: reconstructed_6codes dtype: audio splits: - name: test num_bytes: 210150174.0 num_examples: 16 download_size: 181357972 dataset_size: 210150174.0 - config_name: ja_asr.reazonspeech_test features: - name: original dtype: audio - name: reconstructed_2codes dtype: audio - name: reconstructed_3codes dtype: audio - name: reconstructed_3codes.150chunks.120strides.crossfade dtype: audio - name: reconstructed_3codes.150chunks.120strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.150chunks.120strides.first dtype: audio - name: reconstructed_3codes.150chunks.120strides.first.enhancer dtype: audio - name: reconstructed_3codes.150chunks.120strides.last dtype: audio - name: reconstructed_3codes.150chunks.120strides.last.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.crossfade dtype: audio - name: reconstructed_3codes.150chunks.140strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.first dtype: audio - name: reconstructed_3codes.150chunks.140strides.first.enhancer dtype: audio - name: reconstructed_3codes.150chunks.140strides.last dtype: audio - name: reconstructed_3codes.150chunks.140strides.last.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.crossfade dtype: audio - name: reconstructed_3codes.75chunks.55strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.first dtype: audio - name: reconstructed_3codes.75chunks.55strides.first.enhancer dtype: audio - name: reconstructed_3codes.75chunks.55strides.last dtype: audio - name: reconstructed_3codes.75chunks.55strides.last.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.crossfade dtype: audio - name: reconstructed_3codes.75chunks.65strides.crossfade.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.first dtype: audio - name: reconstructed_3codes.75chunks.65strides.first.enhancer dtype: audio - name: reconstructed_3codes.75chunks.65strides.last dtype: audio - name: reconstructed_3codes.75chunks.65strides.last.enhancer dtype: audio - name: reconstructed_3codes.enhancer dtype: audio - name: reconstructed_4codes dtype: audio - name: reconstructed_5codes dtype: audio - name: reconstructed_6codes dtype: audio splits: - name: test num_bytes: 361331646.0 num_examples: 16 download_size: 274007178 dataset_size: 361331646.0 - config_name: sample_audio features: - name: reconstructed_2codes dtype: audio - name: reconstructed_3codes dtype: audio - name: reconstructed_3codes.enhancer dtype: audio - name: reconstructed_4codes dtype: audio - name: reconstructed_5codes dtype: audio - name: reconstructed_6codes dtype: audio splits: - name: test num_bytes: 4472522.0 num_examples: 3 download_size: 4344170 dataset_size: 4472522.0 configs: - config_name: ja_asr.jsut_basic5000 data_files: - split: test path: ja_asr.jsut_basic5000/test-* - config_name: ja_asr.reazonspeech_test data_files: - split: test path: ja_asr.reazonspeech_test/test-* - config_name: sample_audio data_files: - split: test path: sample_audio/test-* ---
提供机构:
kotoba-tech
原始信息汇总

数据集概述

数据集1: ja_asr.jsut_basic5000

  • 配置名称: ja_asr.jsut_basic5000
  • 特征:
    • original: 音频
    • reconstructed_2codes: 音频
    • reconstructed_3codes: 音频
    • reconstructed_3codes.150chunks.120strides.crossfade: 音频
    • reconstructed_3codes.150chunks.120strides.crossfade.enhancer: 音频
    • reconstructed_3codes.150chunks.120strides.first: 音频
    • reconstructed_3codes.150chunks.120strides.first.enhancer: 音频
    • reconstructed_3codes.150chunks.120strides.last: 音频
    • reconstructed_3codes.150chunks.120strides.last.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.crossfade: 音频
    • reconstructed_3codes.150chunks.140strides.crossfade.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.first: 音频
    • reconstructed_3codes.150chunks.140strides.first.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.last: 音频
    • reconstructed_3codes.150chunks.140strides.last.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.crossfade: 音频
    • reconstructed_3codes.75chunks.55strides.crossfade.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.first: 音频
    • reconstructed_3codes.75chunks.55strides.first.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.last: 音频
    • reconstructed_3codes.75chunks.55strides.last.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.crossfade: 音频
    • reconstructed_3codes.75chunks.65strides.crossfade.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.first: 音频
    • reconstructed_3codes.75chunks.65strides.first.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.last: 音频
    • reconstructed_3codes.75chunks.65strides.last.enhancer: 音频
    • reconstructed_3codes.enhancer: 音频
    • reconstructed_4codes: 音频
    • reconstructed_5codes: 音频
    • reconstructed_6codes: 音频
  • 分割:
    • test: 16个样本,数据大小210150174.0字节
  • 下载大小: 181357972字节
  • 数据集大小: 210150174.0字节

数据集2: ja_asr.reazonspeech_test

  • 配置名称: ja_asr.reazonspeech_test
  • 特征:
    • original: 音频
    • reconstructed_2codes: 音频
    • reconstructed_3codes: 音频
    • reconstructed_3codes.150chunks.120strides.crossfade: 音频
    • reconstructed_3codes.150chunks.120strides.crossfade.enhancer: 音频
    • reconstructed_3codes.150chunks.120strides.first: 音频
    • reconstructed_3codes.150chunks.120strides.first.enhancer: 音频
    • reconstructed_3codes.150chunks.120strides.last: 音频
    • reconstructed_3codes.150chunks.120strides.last.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.crossfade: 音频
    • reconstructed_3codes.150chunks.140strides.crossfade.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.first: 音频
    • reconstructed_3codes.150chunks.140strides.first.enhancer: 音频
    • reconstructed_3codes.150chunks.140strides.last: 音频
    • reconstructed_3codes.150chunks.140strides.last.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.crossfade: 音频
    • reconstructed_3codes.75chunks.55strides.crossfade.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.first: 音频
    • reconstructed_3codes.75chunks.55strides.first.enhancer: 音频
    • reconstructed_3codes.75chunks.55strides.last: 音频
    • reconstructed_3codes.75chunks.55strides.last.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.crossfade: 音频
    • reconstructed_3codes.75chunks.65strides.crossfade.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.first: 音频
    • reconstructed_3codes.75chunks.65strides.first.enhancer: 音频
    • reconstructed_3codes.75chunks.65strides.last: 音频
    • reconstructed_3codes.75chunks.65strides.last.enhancer: 音频
    • reconstructed_3codes.enhancer: 音频
    • reconstructed_4codes: 音频
    • reconstructed_5codes: 音频
    • reconstructed_6codes: 音频
  • 分割:
    • test: 16个样本,数据大小361331646.0字节
  • 下载大小: 274007178字节
  • 数据集大小: 361331646.0字节

数据集3: sample_audio

  • 配置名称: sample_audio
  • 特征:
    • reconstructed_2codes: 音频
    • reconstructed_3codes: 音频
    • reconstructed_3codes.enhancer: 音频
    • reconstructed_4codes: 音频
    • reconstructed_5codes: 音频
    • reconstructed_6codes: 音频
  • 分割:
    • test: 3个样本,数据大小4472522.0字节
  • 下载大小: 4344170字节
  • 数据集大小: 4472522.0字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作