rabah2026/Quran-Ayah-Corpus
收藏Hugging Face2025-09-19 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/rabah2026/Quran-Ayah-Corpus
下载链接
链接失效反馈官方服务:
资源简介:
Quran-Ayah-Corpus是一个包含高质量《古兰经》经文(Ayahs)音频记录和对应精确转录的阿拉伯语语音数据集,适用于自动语音识别(ASR)任务。数据集采用16kHz采样率,包含训练集、验证集和测试集,朗诵者分配严格,以确保模型泛化能力。每个数据实例包含音频、持续时间、文本和朗诵者信息。
Quran-Ayah-Corpus is an Arabic speech dataset containing high-quality audio recordings of Quranic verses (Ayahs) along with their precise transcriptions, designed for Automatic Speech Recognition (ASR) tasks. The dataset is standardized at a 16kHz sampling rate and includes train, validation, and test splits with strict reciter assignment for model generalization. Each data instance comprises audio, duration, text, and reciter information.
提供机构:
rabah2026



