古兰经音频数据集:非阿拉伯语者的众包和标记背诵
收藏arXiv2024-05-04 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2405.02675v1
下载链接
链接失效反馈官方服务:
资源简介:
古兰经音频数据集是由Innopolis大学创建,旨在帮助非阿拉伯语者学习古兰经的正确背诵。该数据集包含约7000条背诵音频,来自全球11个非阿拉伯国家的1287名参与者。数据集的创建过程涉及使用众包API集成到移动应用NamazApp中收集音频,并通过Quran Voice平台进行标注。该数据集主要用于训练AI模型,以辅助学习者识别背诵中的错误,提高背诵质量,解决非阿拉伯语者在学习古兰经时面临的语言障碍问题。
The Quran Recitation Audio Dataset was developed by Innopolis University to assist non-Arabic speakers in learning the proper recitation of the Quran. This dataset contains approximately 7,000 recitation audio clips, collected from 1,287 participants across 11 non-Arabic countries worldwide. The dataset was constructed by integrating a crowdsourcing API into the mobile application NamazApp for audio collection, with annotations completed via the Quran Voice platform. This dataset is primarily intended for training AI models to help learners identify recitation errors, improve recitation quality, and address the language barriers faced by non-Arabic speakers when studying the Quran.
提供机构:
Innopolis大学
创建时间:
2024-05-04



