HumanEval

Name: HumanEval
Creator: OpenDataLab
Published: 2026-05-17 04:30:17
License: 暂无描述

OpenDataLab2026-05-17 更新2024-05-09 收录

下载链接：

https://opendatalab.org.cn/OpenDataLab/HumanEval

下载链接

链接失效反馈

官方服务：

资源简介：

这是论文“Evaluating Large Language Models Trained on Code”中描述的 HumanEval 问题解决数据集的评估工具。它用于测量从文档字符串合成程序的功能正确性。它由 164 个原始编程问题组成，评估语言理解、算法和简单的数学，还有一些类似于简单的软件面试问题。

This is the evaluation toolkit for the HumanEval problem-solving dataset described in the paper *Evaluating Large Language Models Trained on Code*. It is designed to measure the functional correctness of programs synthesized from docstrings. It comprises 164 original programming problems that assess language comprehension, algorithmic proficiency, and basic mathematical reasoning, with some items similar to entry-level software engineering interview questions.

提供机构：

OpenDataLab

创建时间：

2022-06-23

搜集汇总

数据集介绍