VladS159/common_voice_17_0_without_synthetic_data
收藏Hugging Face2025-12-13 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/VladS159/common_voice_17_0_without_synthetic_data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含音频和对应文本句子的集合,音频采样率为48000Hz。数据集中的每个样本包含音频文件、对应的文本句子以及一个标识是否为合成数据的布尔值标签。数据集分为训练集(35,289个样本)和测试集(4,432个样本),总下载大小约为2.38GB,数据集总大小约为2.4GB。
This dataset is a collection of audio files paired with corresponding text sentences, with audio sampled at 48kHz. Each sample in the dataset includes an audio file, a corresponding text sentence, and a boolean label indicating whether the data is synthetic. The dataset is divided into a training set (35,289 samples) and a test set (4,432 samples), with a total download size of approximately 2.38GB and a total dataset size of approximately 2.4GB.
提供机构:
VladS159



