SWE-Sharp-Bench

Name: SWE-Sharp-Bench
Creator: maas
Published: 2026-01-06 16:51:16
License: 暂无描述

魔搭社区2026-01-06 更新2025-11-08 收录

下载链接：

https://modelscope.cn/datasets/microsoft/SWE-Sharp-Bench

下载链接

链接失效反馈

官方服务：

资源简介：

# SWE-Sharp-Bench SWE-Sharp-Bench is a comprehensive benchmark suite for evaluating software engineering capabilities of AI agents and models on C# and .NET codebases. This benchmark extends the SWE-Bench framework to the C# ecosystem, providing real-world software engineering tasks from popular open-source repositories. Code - https://github.com/microsoft/prose/tree/main/misc/SWE-Sharp-Bench Research Paper Draft & Benchmark Analysis: https://aka.ms/swesharparxiv ## Contact If you have suggestions or any questions, please raise an issue on Github or contact us at t-smhatre@microsoft.com . ## Citation If you find our work helpful for your research, please consider citing our work. ``` @misc{mhatre2025swesharpbenchreproduciblebenchmarkc, title={SWE-Sharp-Bench: A Reproducible Benchmark for C# Software Engineering Tasks}, author={Sanket Mhatre and Yasharth Bajpai and Sumit Gulwani and Emerson Murphy-Hill and Gustavo Soares}, year={2025}, eprint={2511.02352}, archivePrefix={arXiv}, primaryClass={cs.SE}, url={https://arxiv.org/abs/2511.02352}, } ```

# SWE-Sharp-Bench SWE-Sharp-Bench是一套用于评估AI智能体（AI Agent）与大语言模型（Large Language Model，LLM）在C#及.NET代码库上软件工程能力的综合基准测试套件。本基准将SWE-Bench框架拓展至C#生态系统，提供源自热门开源仓库的真实软件工程任务。代码仓库：https://github.com/microsoft/prose/tree/main/misc/SWE-Sharp-Bench 研究论文草案与基准测试分析：https://aka.ms/swesharparxiv ## 联系我们若您有任何建议或疑问，请在GitHub上提交Issue，或通过邮箱t-smhatre@microsoft.com与我们取得联系。 ## 引用若您的研究工作得益于本项目，请考虑引用我们的成果。 @misc{mhatre2025swesharpbenchreproduciblebenchmarkc, title={SWE-Sharp-Bench: A Reproducible Benchmark for C# Software Engineering Tasks}, author={Sanket Mhatre and Yasharth Bajpai and Sumit Gulwani and Emerson Murphy-Hill and Gustavo Soares}, year={2025}, eprint={2511.02352}, archivePrefix={arXiv}, primaryClass={cs.SE}, url={https://arxiv.org/abs/2511.02352}, }

提供机构：

maas

创建时间：

2025-11-04

5,000+

优质数据集

54 个

任务类型

进入经典数据集