BrunoHays/darija-speech-to-text
收藏Hugging Face2024-12-19 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/BrunoHays/darija-speech-to-text
下载链接
链接失效反馈官方服务:
资源简介:
数据集名为Speech To Text Darija dataset,是一个自动语音识别(ASR)任务的数据集。数据集包含音频文件和对应的文本转录,音频文件的采样率为16000Hz。数据集分为训练集和验证集,训练集包含4336个样本,验证集包含1065个样本。每个样本包含音频、文本、开始时间、结束时间和音频ID等信息。数据集的下载大小为4928036729字节,总大小为4953772276.875字节。数据集是adiren7/darija_speech_to_text的重新上传版本。
The dataset is named Speech To Text Darija dataset and is designed for Automatic Speech Recognition (ASR) tasks. It includes audio files and their corresponding text transcriptions, with audio files sampled at 16000Hz. The dataset is divided into training and validation sets, containing 4336 and 1065 samples respectively. Each sample includes audio, text, start time, end time, and audio ID. The datasets download size is 4928036729 bytes, with a total size of 4953772276.875 bytes. This dataset is a reupload of adiren7/darija_speech_to_text.
提供机构:
BrunoHays



