mpanda27/common_voice_16_0_hu_pseudo_labelled

Name: mpanda27/common_voice_16_0_hu_pseudo_labelled
Creator: mpanda27
Published: 2024-11-30 00:23:46
License: 暂无描述

Hugging Face2024-11-30 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/mpanda27/common_voice_16_0_hu_pseudo_labelled

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含音频和文本数据，主要用于语音识别或相关任务。数据集的特征包括音频路径、音频数据（采样率为16000Hz）、句子文本、条件序列和Whisper转录文本。数据集分为训练集、验证集和测试集，分别包含6976、2240和2366个样本。

This dataset contains audio and text data, primarily used for speech recognition or related tasks. The features of the dataset include audio paths, audio data (with a sampling rate of 16000Hz), sentence text, condition sequences, and Whisper transcriptions. The dataset is divided into training, validation, and test sets, containing 6976, 2240, and 2366 samples respectively.

提供机构：

mpanda27

5,000+

优质数据集

54 个

任务类型

进入经典数据集