gitgood/git_good_bench-train
收藏Hugging Face2025-03-31 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/gitgood/git_good_bench-train
下载链接
链接失效反馈官方服务:
资源简介:
GitGoodBench Lite是一个由17469个样本组成的子集,用于收集AI代理解决git任务的轨迹(见支持的场景),用于模型训练目的。我们支持Python、Java和Kotlin编程语言以及合并冲突解决和文件提交链的样本类型。数据集中的所有数据都是从816个独特的、具有宽松许可的开源GitHub仓库中收集的,这些仓库拥有>=1000个star、>=5个分支、>=10个贡献者且不是分叉或存档的。我们使用[SEART](https://seart-ghs.si.usi.ch/)初始化仓库列表。
GitGoodBench Lite is a subset of 17469 samples for collecting trajectories of AI agents resolving git tasks (see Supported Scenarios) for model training purposes. We support the programming languages Python, Java, and Kotlin, and the sample types merge conflict resolution and file-commit chain. All data in this dataset are collected from 816 unique, open-source GitHub repositories with permissive licenses that have >=1000 stars, >=5 branches, >=10 contributors and are not a fork or archived. We collected the initial list of repositories using [SEART](https://seart-ghs.si.usi.ch/).
提供机构:
gitgood



