five

NaijaRC

收藏
arXiv2024-05-19 更新2024-06-21 收录
下载链接:
https://github.com/AremuAdeolaJr/NaijaRC
下载链接
链接失效反馈
官方服务:
资源简介:
NaijaRC是一个专为尼日利亚语言设计的多项选择阅读理解数据集,涵盖了Hausa、Igbo和Yorùbá三种语言。该数据集由Masakhane NLP创建,基于高中阅读理解考试题目,旨在提升机器在理解长段落和回答问题方面的性能。数据集内容包括阅读理解段落、问题及答案,均由母语者和语言学家仔细验证和清理。NaijaRC的应用领域主要集中在提升非洲语言的机器阅读理解能力,解决资源匮乏语言的AI模型开发挑战。

NaijaRC is a multiple-choice reading comprehension dataset tailored for Nigerian languages, encompassing Hausa, Igbo, and Yorùbá. It was developed by Masakhane NLP and sourced from high school reading comprehension examination questions, with the primary objective of advancing machine performance in long passage comprehension and question answering. The dataset comprises reading comprehension passages, questions, and corresponding answers, all of which have been rigorously verified and curated by native speakers and linguists. The core application scope of NaijaRC lies in improving machine reading comprehension capabilities for African languages, and addressing the development hurdles of AI models for low-resource languages.
提供机构:
Masakhane NLP
创建时间:
2023-08-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作