VladS159/common_voice_17_0_without_synthetic_data

Name: VladS159/common_voice_17_0_without_synthetic_data
Creator: VladS159
Published: 2025-12-13 10:16:44
License: 暂无描述

Hugging Face2025-12-13 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/VladS159/common_voice_17_0_without_synthetic_data

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是一个包含音频和对应文本句子的集合，音频采样率为48000Hz。数据集中的每个样本包含音频文件、对应的文本句子以及一个标识是否为合成数据的布尔值标签。数据集分为训练集（35,289个样本）和测试集（4,432个样本），总下载大小约为2.38GB，数据集总大小约为2.4GB。

This dataset is a collection of audio files paired with corresponding text sentences, with audio sampled at 48kHz. Each sample in the dataset includes an audio file, a corresponding text sentence, and a boolean label indicating whether the data is synthetic. The dataset is divided into a training set (35,289 samples) and a test set (4,432 samples), with a total download size of approximately 2.38GB and a total dataset size of approximately 2.4GB.

提供机构：

VladS159

5,000+

优质数据集

54 个

任务类型

进入经典数据集