SWE-Bench_Lite_UTBoost
收藏魔搭社区2025-12-04 更新2025-07-05 收录
下载链接:
https://modelscope.cn/datasets/AI-ModelScope/SWE-Bench_Lite_UTBoost
下载链接
链接失效反馈官方服务:
资源简介:
**Dataset Summary**
In this dataset, we replace some test suites in princeton-nlp/SWE-bench_Verified with augmented test cases to enable a more rigorous evaluation of SWE-Bench. UTBoost was accepted in ACL 2025, and we have opened-sourced the code and metadata in https://github.com/CUHK-Shenzhen-SE/UTBoost.
**Dataset Structure**
An example of a SWE-bench datum is as follows:
```
instance_id: (str) - A formatted instance identifier, usually as repo_owner__repo_name-PR-number.
patch: (str) - The gold patch, the patch generated by the PR (minus test-related code), that resolved the issue.
repo: (str) - The repository owner/name identifier from GitHub.
base_commit: (str) - The commit hash of the repository representing the HEAD of the repository before the solution PR is applied.
hints_text: (str) - Comments made on the issue prior to the creation of the solution PR’s first commit creation date.
created_at: (str) - The creation date of the pull request.
test_patch: (str) - A test-file patch that was contributed by the solution PR.
problem_statement: (str) - The issue title and body.
version: (str) - Installation version to use for running evaluation.
environment_setup_commit: (str) - commit hash to use for environment setup and installation.
FAIL_TO_PASS: (str) - A json list of strings that represent the set of tests resolved by the PR and tied to the issue resolution.
PASS_TO_PASS: (str) - A json list of strings that represent tests that should pass before and after the PR application.
```
**数据集概述**
本数据集通过将princeton-nlp/SWE-bench_Verified中的部分测试套件替换为增强测试用例,以实现对SWE-Bench更为严谨的评估。本次工作的UTBoost已被ACL 2025收录,相关代码与元数据已开源至https://github.com/CUHK-Shenzhen-SE/UTBoost。
**数据集结构**
SWE-bench单条数据示例如下:
实例ID(instance_id): (str) - 格式化后的实例标识符,通常格式为repo_owner__repo_name-PR-number。
补丁(patch): (str) - 解决该问题的拉取请求(Pull Request,简称PR)所生成的黄金补丁(剔除与测试相关的代码)。
仓库(repo): (str) - 来自GitHub的仓库所有者/名称标识符。
基础提交哈希(base_commit): (str) - 代表应用解决方案PR前的仓库HEAD对应的提交哈希值。
提示文本(hints_text): (str) - 即在解决方案PR的首次提交创建日期之前,针对该议题(issue)留下的评论内容。
创建时间(created_at): (str) - 该拉取请求的创建日期。
测试补丁(test_patch): (str) - 由解决方案PR提交的测试文件补丁。
问题描述(problem_statement): (str) - 该议题(issue)的标题与正文内容。
版本(version): (str) - 运行评估时所使用的安装版本。
环境设置提交哈希(environment_setup_commit): (str) - 用于环境搭建与安装的提交哈希值。
FAIL_TO_PASS: (str) - 一个JSON格式的字符串列表,代表该PR所解决且与该议题(issue)问题解决相关的测试用例集合。
PASS_TO_PASS: (str) - 一个JSON格式的字符串列表,代表在应用PR前后均应通过的测试用例集合。
提供机构:
maas
创建时间:
2025-07-04



