ai-safety-institute/mmlu-translated
收藏Hugging Face2026-04-23 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/ai-safety-institute/mmlu-translated
下载链接
链接失效反馈官方服务:
资源简介:
MMLU Translated数据集是MMLU(大规模多任务语言理解)数据集的部分翻译版本,包含高中数学、高中宏观经济学、大学数学、高中生物学和高中化学等科目的西班牙语翻译。数据来源于`cais/mmlu`数据集的`test`分割,西班牙语翻译由Claude生成。每个科目配置都有两个分割:`english`(原始MMLU测试集)和`spanish`(翻译版本),且行通过`id`对齐,共享相同的`answer`索引。数据集的结构包括`id`、`question`、`choices`(四个选项的列表)、`answer`(整数)和`subject`字段。
MMLU Translated dataset is a partially translated version of the MMLU (Massive Multitask Language Understanding) dataset, including Spanish translations for selected subjects such as high school mathematics, high school macroeconomics, college mathematics, high school biology, and high school chemistry. The data is sourced from the `test` split of the `cais/mmlu` dataset, with Spanish translations generated by Claude. Each subject configuration has two splits: `english` (verbatim from MMLU `test`) and `spanish` (translated), with rows aligned by `id` and sharing the same `answer` index. The dataset schema includes `id`, `question`, `choices` (list of 4 strings), `answer` (int), and `subject` fields.
提供机构:
ai-safety-institute



