mpanda27/voxpopuli_sk_pseudo_labelled
收藏Hugging Face2024-11-30 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/mpanda27/voxpopuli_sk_pseudo_labelled
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频ID、音频数据、标准化文本、条件序列和Whisper转录文本等特征。数据集分为训练集、验证集和测试集,分别包含4560、305和265个样本。音频数据的采样率为16000Hz。
The dataset includes features such as audio ID, audio data, normalized text, condition sequence, and Whisper transcript. The dataset is divided into training, validation, and test sets, containing 4560, 305, and 265 samples respectively. The audio data has a sampling rate of 16000Hz.
提供机构:
mpanda27



