tuanmanh28/control_dataset
收藏Hugging Face2024-01-31 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/tuanmanh28/control_dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
dataset_info:
features:
- name: audio
dtype: audio
- name: file
dtype: string
- name: text
dtype: string
- name: speaker_id
dtype: string
splits:
- name: clean_train
num_bytes: 343258414.1637759
num_examples: 2231
- name: clean_val
num_bytes: 81030474.8752241
num_examples: 558
- name: noise_train
num_bytes: 235319051.21226862
num_examples: 1929
- name: noise_val
num_bytes: 58098436.50373134
num_examples: 483
- name: noise_test
num_bytes: 68465351.0
num_examples: 634
- name: clean_test
num_bytes: 99477037.0
num_examples: 747
download_size: 920722792
dataset_size: 885648764.755
configs:
- config_name: default
data_files:
- split: clean_train
path: data/clean_train-*
- split: clean_val
path: data/clean_val-*
- split: noise_train
path: data/noise_train-*
- split: noise_val
path: data/noise_val-*
- split: clean_test
path: data/clean_test-*
- split: noise_test
path: data/noise_test-*
---
提供机构:
tuanmanh28
原始信息汇总
数据集概述
特征信息
- 音频:数据类型为音频
- 文件:数据类型为字符串
- 文本:数据类型为字符串
- 说话者ID:数据类型为字符串
数据分割
- clean_train:字节数为343258414.1637759,样本数为2231
- clean_val:字节数为81030474.8752241,样本数为558
- noise_train:字节数为235319051.21226862,样本数为1929
- noise_val:字节数为58098436.50373134,样本数为483
- noise_test:字节数为68465351.0,样本数为634
- clean_test:字节数为99477037.0,样本数为747
数据集大小
- 下载大小:920722792字节
- 数据集大小:885648764.755字节
配置信息
- 默认配置:
- clean_train:路径为
data/clean_train-* - clean_val:路径为
data/clean_val-* - noise_train:路径为
data/noise_train-* - noise_val:路径为
data/noise_val-* - clean_test:路径为
data/clean_test-* - noise_test:路径为
data/noise_test-*
- clean_train:路径为



