FudanSELab/ClassEval

Name: FudanSELab/ClassEval
Creator: FudanSELab
Published: 2024-07-05 02:05:14
License: 暂无描述

Hugging Face2024-07-05 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/FudanSELab/ClassEval

下载链接

链接失效反馈

官方服务：

资源简介：

ClassEval数据集是一个手动构建的基准数据集，用于评估大型语言模型在类级别代码生成任务上的性能。该数据集包含100个Python类级别的编码任务，涵盖了管理信息系统、数据格式化、数学运算、游戏开发、文件处理、数据库操作和自然语言处理等多个主题。每个任务包含类骨架、测试用例、解决方案代码等详细信息。数据集仅包含一个测试分割，包含100个样本。

ClassEval is a manually-crafted benchmark dataset for evaluating the performance of large language models on class-level code generation tasks. The dataset consists of 100 class-level Python coding tasks, covering a wide range of topics including Management Systems, Data Formatting, Mathematical Operations, Game Development, File Handling, Database Operations, and Natural Language Processing. Each task includes detailed information such as class skeleton, test cases, and solution code. The dataset only contains a test split with 100 samples.

提供机构：

FudanSELab

原始信息汇总

数据集卡片 for FudanSELab ClassEval

数据集描述

数据集概述

我们手动构建了包含100个类级别Python编程任务的ClassEval数据集，包含100个类和412个方法，平均每个类有33.1个测试用例。这些任务涵盖了广泛的主题，包括管理系统、数据格式化、数学运算、游戏开发、文件处理、数据库操作和自然语言处理。

语言

编程语言为Python，注释和文档字符串使用的自然语言为英语。

数据集结构

python from datasets import load_dataset dataset = load_dataset("FudanSELab/ClassEval")

DatasetDict({ test: Dataset({ features: [task_id, skeleton, test, solution_code, import_statement, class_description, methods_info, class_name, test_classes, class_constructor, fields], num_rows: 100 }) })

数据字段

每个任务的具体数据字段如下：

task_id: 每个任务的唯一标识符。
skeleton: 类骨架，包括所有输入描述。
test: 整个类的所有测试用例。
solution_code: 每个任务的类级别代码。
import_statement: 每个任务的导入语句。
class_name: 类的名称。
class_description: 类的目的和功能描述。
class_constructor: 类的构造函数。
fields: 类构造函数中定义的字段。
methods_info: 包含每个方法的详细信息，包括：
- method_name: 方法签名。
- method_input: 方法合约设计，包括所有输入描述。
- test_code: 方法的测试用例。
- solution_code: 方法级别的代码。
- dependencies: 方法的依赖信息。

数据分割

数据集仅包含一个测试分割，包含100个样本。

数据集创建

源数据

手动构建

附加信息

许可信息

该数据集遵循MIT许可证。

引用信息

@misc{du2023classeval, title={ClassEval: A Manually-Crafted Benchmark for Evaluating LLMs on Class-level Code Generation}, author={Xueying Du and Mingwei Liu and Kaixin Wang and Hanlin Wang and Junwei Liu and Yixuan Chen and Jiayi Feng and Chaofeng Sha and Xin Peng and Yiling Lou}, year={2023}, eprint={2308.01861}, archivePrefix={arXiv}, primaryClass={cs.CL} }