HeshamHaroon/ArabicRAGB
收藏Hugging Face2025-12-15 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/HeshamHaroon/ArabicRAGB
下载链接
链接失效反馈官方服务:
资源简介:
ArabicRAGB是一个用于评估阿拉伯语任务中检索增强生成(RAG)系统的基准数据集。每条记录包含一个查询-段落对,其中查询基于段落内容生成并可从中找到答案。关键特征包括:基于段落的查询、多方言覆盖(现代标准阿拉伯语、埃及阿拉伯语、海湾阿拉伯语、黎凡特阿拉伯语和马格里布阿拉伯语)、复杂度级别(简单、中等、复杂和多跳查询)以及统一格式(每条记录包含查询和段落)。数据集包含13,163条记录,涵盖地理、历史、文化、科学、健康、法律和经济等多个主题。
ArabicRAGB is a benchmark dataset for evaluating Retrieval-Augmented Generation (RAG) systems on Arabic language tasks. Each record contains a query-passage pair where the query is grounded in the passage content. Key features include: Passage-Grounded Queries, Multi-Dialect Coverage (MSA, Egyptian, Gulf, Levantine, and Maghrebi Arabic), Complexity Levels (Simple, Moderate, Complex, and Multi-hop queries), and Unified Format (Query and passage together in each record). The dataset contains 13,163 records covering diverse topics such as Geography, History, Culture, Science, Health, Law, and Economy.
提供机构:
HeshamHaroon



