five

MBZUAI/DuwatBench

收藏
Hugging Face2026-01-28 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/MBZUAI/DuwatBench
下载链接
链接失效反馈
官方服务:
资源简介:
DuwatBench是一个全面的基准数据集,用于评估语言模型在阿拉伯书法识别上的表现。阿拉伯书法代表了阿拉伯语言最丰富的视觉传统之一,将语言意义与艺术形式相结合。DuwatBench填补了现代AI系统处理风格化阿拉伯文本评估的空白。数据集包含1,272个精选样本,涵盖6种古典和现代书法风格,超过9.5k个单词实例和约1,475个独特单词,涉及宗教和文化领域。此外,数据集还提供了边界框注释、完整文本转录、风格和主题标签,以及复杂的艺术背景,以保持现实世界的视觉复杂性。

DuwatBench is a comprehensive benchmark for evaluating LMMs on Arabic calligraphy recognition. Arabic calligraphy represents one of the richest visual traditions of the Arabic language, blending linguistic meaning with artistic form. DuwatBench addresses the gap in evaluating how well modern AI systems can process stylized Arabic text. The dataset contains 1,272 curated samples spanning 6 classical and modern calligraphic styles, over 9.5k word instances with approximately 1,475 unique words spanning religious and cultural domains. It also includes bounding box annotations, full text transcriptions with style and theme labels, and complex artistic backgrounds preserving real-world visual complexity.
提供机构:
MBZUAI
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作