mpanda27/voxpopuli_sl_pseudo_labelled

Name: mpanda27/voxpopuli_sl_pseudo_labelled
Creator: mpanda27
Published: 2024-12-01 01:30:27
License: 暂无描述

Hugging Face2024-12-01 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/mpanda27/voxpopuli_sl_pseudo_labelled

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含音频和文本数据，具体特征包括音频ID、音频文件、归一化文本、条件序列和Whisper转录文本。数据集分为训练集、验证集和测试集，训练集包含938个样本，验证集包含432个样本，测试集包含127个样本。音频文件的采样率为16000Hz。

This dataset contains audio and text data, with specific features including audio ID, audio files, normalized text, condition sequences, and Whisper transcripts. The dataset is divided into training, validation, and test sets, with the training set containing 938 samples, the validation set containing 432 samples, and the test set containing 127 samples. The audio files have a sampling rate of 16000Hz.

提供机构：

mpanda27

5,000+

优质数据集

54 个

任务类型

进入经典数据集