mpanda27/voxpopuli_sl_pseudo_labelled
收藏Hugging Face2024-12-01 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/mpanda27/voxpopuli_sl_pseudo_labelled
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频和文本数据,具体特征包括音频ID、音频文件、归一化文本、条件序列和Whisper转录文本。数据集分为训练集、验证集和测试集,训练集包含938个样本,验证集包含432个样本,测试集包含127个样本。音频文件的采样率为16000Hz。
This dataset contains audio and text data, with specific features including audio ID, audio files, normalized text, condition sequences, and Whisper transcripts. The dataset is divided into training, validation, and test sets, with the training set containing 938 samples, the validation set containing 432 samples, and the test set containing 127 samples. The audio files have a sampling rate of 16000Hz.
提供机构:
mpanda27



