five

galsenai/wolof-audio-data

收藏
Hugging Face2024-12-25 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/galsenai/wolof-audio-data
下载链接
链接失效反馈
资源简介:
Wolof Audio Dataset是一个包含沃洛夫语(Wolof)音频录音及其对应转录的数据集。该数据集旨在支持沃洛夫语的自动语音识别(ASR)模型的开发。数据集由四个现有数据集组合而成:ALFFA、FLEURS、Urban Bus Wolof Speech Dataset和Kallama Dataset。数据集包含音频文件和转录文本,音频文件的采样率为16 kHz。数据集分为训练集和测试集,分别包含28,807和6,268个样本。

The Wolof Audio Dataset is a collection of audio recordings and their corresponding transcriptions in Wolof. This dataset is designed to support the development of Automatic Speech Recognition (ASR) models for the Wolof language. It was created by combining three existing datasets: ALFFA, FLEURS, Urban Bus Wolof Speech Dataset, and Kallama Dataset. The dataset includes audio files with varying formats and a sampling rate of 16 kHz, along with their transcriptions and the source of each example. The dataset is divided into training and test splits, with a total of 24,346 examples.
提供机构:
galsenai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作