SWE-bench_Multimodal
收藏魔搭社区2025-11-27 更新2025-05-10 收录
下载链接:
https://modelscope.cn/datasets/SWE-bench/SWE-bench_Multimodal
下载链接
链接失效反馈官方服务:
资源简介:
# SWE-bench Multimodal
SWE-bench Multimodal is a dataset of 617 task instances that evalutes Language Models and AI Systems on their ability to resolve real world GitHub issues.
To learn more about the dataset, please visit [our website](https://swebench.com/multimodal).
You can find the leaderboard at SWE-bench's [home page](https://www.swebench.com/#multimodal).
# SWE-bench Multimodal
SWE-bench Multimodal 是一个包含617个任务实例的数据集,用于评估大语言模型(Large Language Model,LLM)与AI系统解决真实场景下GitHub议题的能力。
如需了解该数据集的更多详情,请访问[我们的官方网站](https://swebench.com/multimodal)。你可在SWE-bench的[首页](https://www.swebench.com/#multimodal)查看排行榜。
提供机构:
maas
创建时间:
2025-05-08



