AfriQA
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/masakhane-io/afriqa
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为AfriQA,是首个专注于非洲语言的跨语言问答数据集,包含了超过12,000个XOR问答示例,覆盖了10种非洲语言。该数据集特别关注那些数字内容资源较少的语言,并包含了只有在跨语言答案内容中才能找到高覆盖度信息的示例。规模上,该数据集拥有12,000多个示例,其任务是跨语言的开放检索式问答。
The dataset named AfriQA is the first cross-lingual question answering dataset dedicated to African languages. It contains over 12,000 XOR-style question answering examples spanning 10 African languages, with particular emphasis on languages with limited digital resources. It also incorporates examples whose high-coverage information can only be accessed via cross-lingual answer content. With more than 12,000 instances in total, the core task of this dataset is cross-lingual open-retrieval question answering.
提供机构:
Research Team



