Abdo-Alshoki/Tashkeel-Corpus
收藏Hugging Face2024-12-29 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Abdo-Alshoki/Tashkeel-Corpus
下载链接
链接失效反馈官方服务:
资源简介:
Tashkeel-Corpus是一个现代标准阿拉伯语文本数据集,其中的文本带有丰富的Tashkeel(标点符号)。这个数据集为标点化、语音合成、语音识别和阿拉伯语语言建模等任务提供了宝贵的资源。
The Tashkeel-Corpus is a dataset of Modern Standard Arabic (MSA) texts enriched with diacritics (Tashkeel). This corpus serves as a valuable resource for tasks such as diacritization, speech synthesis, speech recognition, and Arabic language modeling.
提供机构:
Abdo-Alshoki



