SWE-bench_Multimodal

Name: SWE-bench_Multimodal
Creator: maas
Published: 2025-11-27 16:32:48
License: 暂无描述

魔搭社区2025-11-27 更新2025-05-10 收录

下载链接：

https://modelscope.cn/datasets/SWE-bench/SWE-bench_Multimodal

下载链接

链接失效反馈

官方服务：

资源简介：

# SWE-bench Multimodal SWE-bench Multimodal is a dataset of 617 task instances that evalutes Language Models and AI Systems on their ability to resolve real world GitHub issues. To learn more about the dataset, please visit [our website](https://swebench.com/multimodal). You can find the leaderboard at SWE-bench's [home page](https://www.swebench.com/#multimodal).

# SWE-bench Multimodal SWE-bench Multimodal 是一个包含617个任务实例的数据集，用于评估大语言模型（Large Language Model，LLM）与AI系统解决真实场景下GitHub议题的能力。如需了解该数据集的更多详情，请访问[我们的官方网站](https://swebench.com/multimodal)。你可在SWE-bench的[首页](https://www.swebench.com/#multimodal)查看排行榜。

提供机构：

maas

创建时间：

2025-05-08

5,000+

优质数据集

54 个

任务类型

进入经典数据集