SWE-smith
收藏魔搭社区2025-12-05 更新2025-05-10 收录
下载链接:
https://modelscope.cn/datasets/SWE-bench/SWE-smith
下载链接
链接失效反馈官方服务:
资源简介:
<div align="center">
<a href="https://swesmith.com/">
<img src="https://avatars.githubusercontent.com/u/189315905?s=200&v=4" alt="Logo" width="200">
<h1 align="center">SWE-smith Dataset</h1>
</a>
</div>
<p align="center">
<a href="https://github.com/SWE-bench/SWE-smith">Code</a>
•
<a href="https://huggingface.co/papers/2504.21798">Paper</a>
•
<a href="https://swesmith.com/">Site</a>
</p>
The SWE-smith Dataset is a training dataset of 50137 task instances from 128 GitHub repositories, collected using the SWE-smith toolkit.
It is the largest dataset to date for training software engineering agents.
All SWE-smith task instances come with an executable environment.
To learn more about how to use this dataset to train Language Models for Software Engineering, please refer to the [documentation](https://swesmith.com/docs).
<div align="center">
<a href="https://swesmith.com/">
<img src="https://avatars.githubusercontent.com/u/189315905?s=200&v=4" alt="Logo" width="200">
<h1 align="center">SWE-smith 数据集(SWE-smith Dataset)</h1>
</a>
</div>
<p align="center">
<a href="https://github.com/SWE-bench/SWE-smith">代码</a>
•
<a href="https://huggingface.co/papers/2504.21798">论文</a>
•
<a href="https://swesmith.com/">官网</a>
</p>
SWE-smith数据集是一款训练数据集,涵盖来自128个GitHub仓库的50137个任务实例,通过SWE-smith工具包(SWE-smith toolkit)采集构建。
该数据集是目前规模最大的用于训练软件工程智能体(software engineering agents)的数据集。
所有SWE-smith任务实例均配套可执行运行环境。
若欲了解更多关于如何使用本数据集训练面向软件工程的大语言模型(Large Language Model)的相关内容,请参阅[官方文档](https://swesmith.com/docs).
提供机构:
maas
创建时间:
2025-05-08



