SWE-Sharp-Bench
收藏魔搭社区2026-01-06 更新2025-11-08 收录
下载链接:
https://modelscope.cn/datasets/microsoft/SWE-Sharp-Bench
下载链接
链接失效反馈官方服务:
资源简介:
# SWE-Sharp-Bench
SWE-Sharp-Bench is a comprehensive benchmark suite for evaluating software engineering capabilities of AI agents and models on C# and .NET codebases. This benchmark extends the SWE-Bench framework to the C# ecosystem, providing real-world software engineering tasks from popular open-source repositories.
Code - https://github.com/microsoft/prose/tree/main/misc/SWE-Sharp-Bench
Research Paper Draft & Benchmark Analysis: https://aka.ms/swesharparxiv
## Contact
If you have suggestions or any questions, please raise an issue on Github or contact us at t-smhatre@microsoft.com .
## Citation
If you find our work helpful for your research, please consider citing our work.
```
@misc{mhatre2025swesharpbenchreproduciblebenchmarkc,
title={SWE-Sharp-Bench: A Reproducible Benchmark for C# Software Engineering Tasks},
author={Sanket Mhatre and Yasharth Bajpai and Sumit Gulwani and Emerson Murphy-Hill and Gustavo Soares},
year={2025},
eprint={2511.02352},
archivePrefix={arXiv},
primaryClass={cs.SE},
url={https://arxiv.org/abs/2511.02352},
}
```
# SWE-Sharp-Bench
SWE-Sharp-Bench是一套用于评估AI智能体(AI Agent)与大语言模型(Large Language Model,LLM)在C#及.NET代码库上软件工程能力的综合基准测试套件。本基准将SWE-Bench框架拓展至C#生态系统,提供源自热门开源仓库的真实软件工程任务。
代码仓库:https://github.com/microsoft/prose/tree/main/misc/SWE-Sharp-Bench
研究论文草案与基准测试分析:https://aka.ms/swesharparxiv
## 联系我们
若您有任何建议或疑问,请在GitHub上提交Issue,或通过邮箱t-smhatre@microsoft.com与我们取得联系。
## 引用
若您的研究工作得益于本项目,请考虑引用我们的成果。
@misc{mhatre2025swesharpbenchreproduciblebenchmarkc,
title={SWE-Sharp-Bench: A Reproducible Benchmark for C# Software Engineering Tasks},
author={Sanket Mhatre and Yasharth Bajpai and Sumit Gulwani and Emerson Murphy-Hill and Gustavo Soares},
year={2025},
eprint={2511.02352},
archivePrefix={arXiv},
primaryClass={cs.SE},
url={https://arxiv.org/abs/2511.02352},
}
提供机构:
maas
创建时间:
2025-11-04



