Buraaq/quran-audio-text-dataset
收藏Hugging Face2025-11-24 更新2025-11-29 收录
下载链接:
https://hf-mirror.com/datasets/Buraaq/quran-audio-text-dataset
下载链接
链接失效反馈官方服务:
资源简介:
Quran-MD数据集是一个包含文本、语言和音频维度的综合多模态数据集,涵盖完整的《古兰经》内容。数据集分为两个独立的数据集:ayah数据集和word数据集。ayah数据集包含30位著名诵读者的完整经文朗诵,而word数据集包含单独的单词发音。数据集提供阿拉伯语、英语和音标转换版本,可用于自然语言处理、语音识别、文本到语音合成和数字伊斯兰研究等应用。README还包含了数据集架构、数据结构格式和数据创建过程的信息,以及如何下载数据集和引用信息。
Quran-MD is a comprehensive multimodal dataset of the Quran with textual, linguistic, and audio dimensions, covering the complete content of the Quran. The dataset is divided into two separate datasets: Ayah Dataset and Word Dataset. The Ayah Dataset contains complete verse recitations from 30 renowned reciters, while the Word Dataset contains individual word pronunciations. The dataset is available in Arabic, English, and transliteration, and can be used for various applications such as natural language processing, speech recognition, text-to-speech synthesis, and digital Islamic studies. The README also includes information on the dataset architecture, data structure format, and data creation process, as well as instructions on how to download the dataset and citation information.
提供机构:
Buraaq



