five

SWE-Swiss/SWESwiss-SFT-Localization-5K

收藏
Hugging Face2025-09-28 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/SWE-Swiss/SWESwiss-SFT-Localization-5K
下载链接
链接失效反馈
官方服务:
资源简介:
SWE-Swiss数据集用于训练SWE-Swiss模型进行本地化任务。该数据集的提示是从SWE-Gym-Raw数据集和一个SWE-bench训练集的子集中构建的。为了防止数据泄露,过滤掉了在SWE-bench测试集中出现的任何仓库。响应是由DeepSeek-R1-0528模型生成的。只有当模型的预测满足两个条件时,一个实例才会被包含在最终数据集中:预测的文件数量不超过五个,且召回率为1.0(即正确识别所有真实文件)。

The SWE-Swiss dataset is used for training SWE-Swiss models on the localization task. The prompts are constructed from a subset of issues in the SWE-Gym-Raw dataset and the SWE-bench training set. Any repositories that also appear in the SWE-bench test set have been filtered out to prevent data leakage. The responses are generated by the DeepSeek-R1-0528 model. An instance is included in the final dataset only if the models prediction meets two conditions: the number of predicted files is five or fewer, and the recall rate is 1.0 (meaning all ground-truth files are correctly identified).
提供机构:
SWE-Swiss
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作