five

SWE-Bench_Lite_UTBoost

收藏
魔搭社区2025-12-04 更新2025-07-05 收录
下载链接:
https://modelscope.cn/datasets/AI-ModelScope/SWE-Bench_Lite_UTBoost
下载链接
链接失效反馈
官方服务:
资源简介:
**Dataset Summary** In this dataset, we replace some test suites in princeton-nlp/SWE-bench_Verified with augmented test cases to enable a more rigorous evaluation of SWE-Bench. UTBoost was accepted in ACL 2025, and we have opened-sourced the code and metadata in https://github.com/CUHK-Shenzhen-SE/UTBoost. **Dataset Structure** An example of a SWE-bench datum is as follows: ``` instance_id: (str) - A formatted instance identifier, usually as repo_owner__repo_name-PR-number. patch: (str) - The gold patch, the patch generated by the PR (minus test-related code), that resolved the issue. repo: (str) - The repository owner/name identifier from GitHub. base_commit: (str) - The commit hash of the repository representing the HEAD of the repository before the solution PR is applied. hints_text: (str) - Comments made on the issue prior to the creation of the solution PR’s first commit creation date. created_at: (str) - The creation date of the pull request. test_patch: (str) - A test-file patch that was contributed by the solution PR. problem_statement: (str) - The issue title and body. version: (str) - Installation version to use for running evaluation. environment_setup_commit: (str) - commit hash to use for environment setup and installation. FAIL_TO_PASS: (str) - A json list of strings that represent the set of tests resolved by the PR and tied to the issue resolution. PASS_TO_PASS: (str) - A json list of strings that represent tests that should pass before and after the PR application. ```

**数据集概述** 本数据集通过将princeton-nlp/SWE-bench_Verified中的部分测试套件替换为增强测试用例,以实现对SWE-Bench更为严谨的评估。本次工作的UTBoost已被ACL 2025收录,相关代码与元数据已开源至https://github.com/CUHK-Shenzhen-SE/UTBoost。 **数据集结构** SWE-bench单条数据示例如下: 实例ID(instance_id): (str) - 格式化后的实例标识符,通常格式为repo_owner__repo_name-PR-number。 补丁(patch): (str) - 解决该问题的拉取请求(Pull Request,简称PR)所生成的黄金补丁(剔除与测试相关的代码)。 仓库(repo): (str) - 来自GitHub的仓库所有者/名称标识符。 基础提交哈希(base_commit): (str) - 代表应用解决方案PR前的仓库HEAD对应的提交哈希值。 提示文本(hints_text): (str) - 即在解决方案PR的首次提交创建日期之前,针对该议题(issue)留下的评论内容。 创建时间(created_at): (str) - 该拉取请求的创建日期。 测试补丁(test_patch): (str) - 由解决方案PR提交的测试文件补丁。 问题描述(problem_statement): (str) - 该议题(issue)的标题与正文内容。 版本(version): (str) - 运行评估时所使用的安装版本。 环境设置提交哈希(environment_setup_commit): (str) - 用于环境搭建与安装的提交哈希值。 FAIL_TO_PASS: (str) - 一个JSON格式的字符串列表,代表该PR所解决且与该议题(issue)问题解决相关的测试用例集合。 PASS_TO_PASS: (str) - 一个JSON格式的字符串列表,代表在应用PR前后均应通过的测试用例集合。
提供机构:
maas
创建时间:
2025-07-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作