AmazonScience/SWE-PolyBench_Verified
收藏Hugging Face2025-12-11 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/AmazonScience/SWE-PolyBench_Verified
下载链接
链接失效反馈官方服务:
资源简介:
SWE-PolyBench是一个多语言的软件工程基准测试,目前包括Python、Java、JavaScript和TypeScript四种语言。验证拆分中每种语言的实例数量分别为:JavaScript 100个,TypeScript 100个,Python 113个,Java 71个。SWE-PolyBench下共有三个数据集:完整数据集、含有500个实例的分层抽样数据集和含有394个实例的验证数据集。数据集结构详细,包括实例ID、补丁、仓库、基准提交、问题提示文本、创建日期、测试补丁、问题陈述、F2P、P2P、编程语言、Dockerfile、测试命令、任务类别以及用于评估的多个布尔型和整型字段。
SWE-PolyBench is a multi-language software engineering benchmark that currently includes Python, Java, Javascript, and Typescript. The number of instances in the verified split are: 100 for Javascript, 100 for TypeScript, 113 for Python, and 71 for Java. There are three datasets under SWE-PolyBench: the full dataset, a stratified sampled dataset with 500 instances, and a verified dataset with 394 instances. The dataset structure is detailed, including fields such as instance_id, patch, repo, base_commit, hints_text, created_at, test_patch, problem_statement, F2P, P2P, language, Dockerfile, test_command, task_category, and several boolean and integer fields for evaluation purposes.
提供机构:
AmazonScience



