crobinnn/common_voice_17_0_id_pseudo_labelled
收藏Hugging Face2024-12-13 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/crobinnn/common_voice_17_0_id_pseudo_labelled
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频和文本数据,特征包括路径、音频、句子、条件序列和Whisper转录文本。数据集分为训练集、验证集和测试集,训练集包含1033个样本,验证集包含507个样本,测试集包含544个样本。数据集的下载大小为1688454783字节,总大小为1812089045.776字节。
This dataset contains audio files along with their corresponding text transcripts generated using the Whisper model. The dataset is divided into train, validation, and test sets, each containing audio files and their transcriptions. The audio files have a sampling rate of 16000Hz.
提供机构:
crobinnn



