MASAQ: Morphologically-Analyzed and Syntactically-Annotated Quran Dataset
收藏Mendeley Data2026-04-09 收录
下载链接:
https://data.mendeley.com/datasets/9yvrzxktmr
下载链接
链接失效反馈官方服务:
资源简介:
The Morphologically-Analyzed and Syntactically-Annotated Quran (MASAQ) dataset is a high-quality, annotated resource designed to advance Arabic Natural Language Processing (NLP). Covering the entire Quran, MASAQ includes over 131K morphological and 123K syntactic entries, verified by expert linguists using traditional i'rab methodologies. Available in multiple formats, it supports a range of applications—from teaching Arabic grammar to enhancing NLP tools like parsers and taggers. By enabling precise language analysis, MASAQ fosters advancements in Arabic NLP and cross-linguistic research, licensed under Creative Commons for ethical use.
提供机构:
The University of Jordan; Amazon.com Inc



