five

RobotsMali/jeli-asr

收藏
Hugging Face2025-01-18 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/RobotsMali/jeli-asr
下载链接
链接失效反馈
官方服务:
资源简介:
Jeli-ASR音频数据集是一个多语种数据集,转换为优化的Arrow格式,确保快速访问和与现代数据工作流的兼容性。它包含Bambara语言的半专家转录和法语翻译的音频样本。每个数据集子集都按配置(jeli-asr-rmai、bam-asr-oza和jeli-asr)组织,并进一步分为训练集和测试集。该数据集旨在用于自动语音识别(ASR)、文本到语音合成(TTS)和翻译任务。数据在马里记录,由griots转录并翻译成法语。

The Jeli-ASR Audio Dataset is a multilingual dataset converted into the optimized Arrow format, ensuring fast access and compatibility with modern data workflows. It contains audio samples in Bambara with semi-expert transcriptions and French translations. Each subset of the dataset is organized by configuration (`jeli-asr-rmai`, `bam-asr-oza`, and `jeli-asr`) and further split into training and testing sets. The dataset is designed for tasks like automatic speech recognition (ASR), text-to-speech synthesis (TTS), and translation. Data was recorded in Mali with griots, then transcribed and translated into French.
提供机构:
RobotsMali
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作