mlfoundations-dev/swe_gym_with_metadata_etash
收藏Hugging Face2025-04-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mlfoundations-dev/swe_gym_with_metadata_etash
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含Git仓库相关信息的数据集,具体包括仓库repo、pull请求编号、实例ID、关联的问题编号、基础提交记录、补丁内容、测试补丁、问题陈述、提示文本、创建时间等字段。数据集还包含了代码文件的相关信息,如内容、扩展名、文件路径、所有者、仓库、文件大小等。此外,数据集提供了PASS_TO_PASS和FAIL_TO_PASS两个序列字段,但其值为null。数据集被分为训练集,包含约9997个示例,总大小约为170GB。
This dataset is a collection of Git repository information, including fields such as repository name, pull request number, instance ID, associated issue numbers, base commit, patch content, test patch, problem statement, hint text, creation time, etc. The dataset also contains information about code files, such as content, extension, file path, owner, repository, file size, etc. Additionally, there are two sequence fields, PASS_TO_PASS and FAIL_TO_PASS, with the value null. The dataset is split into a training set with approximately 9997 examples, totaling about 170GB in size.
提供机构:
mlfoundations-dev



