SeaEval/cross_mmlu

Name: SeaEval/cross_mmlu
Creator: SeaEval
Published: 2024-07-18 05:44:07
License: 暂无描述

Hugging Face2024-07-18 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/SeaEval/cross_mmlu

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多种语言（如英语、中文、西班牙语、越南语、印尼语、马来语和菲律宾语）的问答数据。每个语言部分包含答案、选项和问题三个字段。数据集分为测试集，包含150个示例，总大小为345556字节。

The dataset contains questions and answers in multiple languages, each with a question, choices, and answer section. Languages include English, Chinese, Spanish, Vietnamese, Indonesian, Malay, and Filipino. The dataset is divided into a test set with 150 samples, with a total download size of 244592 bytes and a dataset size of 345556 bytes.

提供机构：

SeaEval

原始信息汇总

数据集概述

数据集结构

id: 字符串类型
English:
- answer: 字符串类型
- choices: 字符串序列
- question: 字符串类型
Chinese:
- answer: 字符串类型
- choices: 字符串序列
- question: 字符串类型
Spanish:
- answer: 字符串类型
- choices: 字符串序列
- question: 字符串类型
Vietnamese:
- answer: 字符串类型
- choices: 字符串序列
- question: 字符串类型
Indonesian:
- answer: 字符串类型
- choices: 字符串序列
- question: 字符串类型
Malay:
- answer: 字符串类型
- choices: 字符串序列
- question: 字符串类型
Filipino:
- answer: 字符串类型
- choices: 字符串序列
- question: 字符串类型

数据集分割

test:
- 样本数量: 150
- 字节数: 345556

数据集大小

下载大小: 244592 字节
数据集大小: 345556 字节

配置

default:
- 数据文件:
  - split: test
  - path: data/test-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集