ceval

Name: ceval
Creator: maas
Published: 2026-07-15 02:02:01
License: 暂无描述

魔搭社区2026-07-15 更新2025-08-16 收录

下载链接：

https://modelscope.cn/datasets/vllm-ascend/ceval

下载链接

链接失效反馈

官方服务：

资源简介：

数据集文件元信息以及数据文件，请浏览“数据集文件”页面获取。当前数据集卡片使用的是默认模版，数据集的贡献者未提供更加详细的数据集介绍，但是您可以通过如下GIT Clone命令，或者ModelScope SDK来下载数据集 #### 下载方法 :modelscope-code[]{type="sdk"} :modelscope-code[]{type="git"}

# C-Eval (Test-Split) in Unified JSONL Format ## Dataset Description This project converts the `test` split of the [C-Eval dataset](https://huggingface.co/datasets/ceval/ceval-exam) into a **unified instruction-style JSONL format** to facilitate the evaluation and testing of Large Language Models (LLMs). C-Eval is a comprehensive Chinese foundational model evaluation suite designed to measure the capabilities of language models in Chinese language and knowledge. The data in this repository is sourced from the `test` split of the original `ceval` dataset, and adopts exactly the same processing pipeline and data structure as the CMMLU dataset, enabling users to conduct evaluations under a unified framework. ## Data Format The dataset is in JSONL format, where each line is a standalone JSON object. This structure is carefully designed to fit standard instruction tuning and inference workflows. Each JSON object contains the following fields: * `id`: Unique identifier of the sample. * `instruction`: Instruction text that guides the model to answer the multiple-choice question. * `choices`: A **dictionary** containing four options, with keys "A", "B", "C", and "D". * `answer`: The correct answer to the question ('A', 'B', 'C', or 'D'). **Format Example:** json { "id": "1", "instruction": "问题: 中国的首都是哪里？请从以下选项中选择一个正确答案。", "choices": { "A": "上海", "B": "北京", "C": "广州", "D": "深圳" }, "answer": "B" } ## Reference - [ceval](https://huggingface.co/datasets/ceval/ceval-exam)

提供机构：

maas

创建时间：

2026-07-04

搜集汇总

数据集介绍

以上内容由遇见数据集搜集并总结生成