AdoCleanCode/consolidated_dac_denoising_dataset_GOOD_4_1_batch_1
收藏Hugging Face2025-09-13 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/AdoCleanCode/consolidated_dac_denoising_dataset_GOOD_4_1_batch_1
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含语音数据的 dataset,其中包括说话人ID(utt_id)、语音时长(duration)、噪声类型(noise_type)、原始文件名(original_file)、原始采样率(original_sr)和语音序列(sequence)等信息。数据集分为训练集,共有200,000个样本,总大小为8,948,949,832字节。提供了默认配置,指定了训练集的数据文件路径。
This is a dataset containing speech data, including speaker ID (utt_id), speech duration (duration), noise type (noise_type), original filename (original_file), original sampling rate (original_sr), and speech sequence (sequence). The dataset is split into a training set with a total of 200,000 samples, totaling 8,948,949,832 bytes. A default configuration is provided, specifying the path to the data files for the training set.
提供机构:
AdoCleanCode



