ebayes/uhura-arc-easy
收藏Hugging Face2024-07-22 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/ebayes/uhura-arc-easy
下载链接
链接失效反馈官方服务:
资源简介:
Uhura-Arc-Easy是一个包含阿姆哈拉语、英语、豪萨语和约鲁巴语版本的多语言多项选择题数据集,适用于训练和评估自然语言处理模型在多项选择题任务上的性能。数据集规模介于1K到10K之间,包含训练、测试和验证数据集。
Uhura-Arc-Easy is a multilingual dataset with versions in Amharic, English, Hausa, and Yoruba, designed for training and evaluating natural language processing models on multiple-choice question tasks. The dataset ranges in size from 1K to 10K and includes training, testing, and validation datasets.
提供机构:
ebayes
原始信息汇总
数据集卡片
数据集详情
数据集描述
- 许可证: cc-by-nc-4.0
- 语言:
- 阿姆哈拉语 (am)
- 英语 (en)
- 豪萨语 (ha)
- 约鲁巴语 (yo)
- 数据规模: 1K<n<10K
- 多语言性: 多语言
- 数据集名称: Uhura-Arc-Easy
- 标签:
- uhura
- arc-easy
- arc
- 任务类别:
- 多选题
- 任务ID:
- 多选题问答
数据集结构
配置
-
config_name: am_multiple_choice
- 数据文件:
- split: train
- path: am_train.json
- split: test
- path: am_test.json
- split: validation
- path: am_dev.json
- split: train
- 数据文件:
-
config_name: en_multiple_choice
- 数据文件:
- split: train
- path: en_train.json
- split: test
- path: en_test.json
- split: validation
- path: en_dev.json
- split: train
- 数据文件:
-
config_name: ha_multiple_choice
- 数据文件:
- split: train
- path: ha_train.json
- split: test
- path: ha_test.json
- split: validation
- path: ha_dev.json
- split: train
- 数据文件:
-
config_name: sw_multiple_choice
- 数据文件:
- split: train
- path: sw_train.json
- split: test
- path: sw_test.json
- split: validation
- path: sw_dev.json
- split: train
- 数据文件:
-
config_name: yo_multiple_choice
- 数据文件:
- split: train
- path: yo_train.json
- split: test
- path: yo_test.json
- split: validation
- path: yo_dev.json
- split: train
- 数据文件:



