five

BrunoHays/darija-speech-to-text

收藏
Hugging Face2024-12-19 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/BrunoHays/darija-speech-to-text
下载链接
链接失效反馈
官方服务:
资源简介:
数据集名为Speech To Text Darija dataset,是一个自动语音识别(ASR)任务的数据集。数据集包含音频文件和对应的文本转录,音频文件的采样率为16000Hz。数据集分为训练集和验证集,训练集包含4336个样本,验证集包含1065个样本。每个样本包含音频、文本、开始时间、结束时间和音频ID等信息。数据集的下载大小为4928036729字节,总大小为4953772276.875字节。数据集是adiren7/darija_speech_to_text的重新上传版本。

The dataset is named Speech To Text Darija dataset and is designed for Automatic Speech Recognition (ASR) tasks. It includes audio files and their corresponding text transcriptions, with audio files sampled at 16000Hz. The dataset is divided into training and validation sets, containing 4336 and 1065 samples respectively. Each sample includes audio, text, start time, end time, and audio ID. The datasets download size is 4928036729 bytes, with a total size of 4953772276.875 bytes. This dataset is a reupload of adiren7/darija_speech_to_text.
提供机构:
BrunoHays
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作