SEED-Bench-2

Name: SEED-Bench-2
Creator: huggingface.co
License: 暂无描述

huggingface.co2025-03-24 收录

下载链接：

https://huggingface.co/datasets/AILab-CVC/SEED-Bench-2

下载链接

链接失效反馈

官方服务：

资源简介：

SEED-Bench Card Benchmark details Benchmark type: SEED-Bench-2 is a comprehensive large-scale benchmark for evaluating Multimodal Large Language Models (MLLMs), featuring 24K multiple-choice questions with precise human annotations. It spans 27 evaluation dimensions, assessing both text and image generation. Benchmark date: SEED-Bench was collected in November 2023. Paper or resources for more information: https://github.com/AILab-CVC/SEED-Bench License:… See the full description on the dataset page: https://huggingface.co/datasets/AILab-CVC/SEED-Bench-2.

SEED-Bench 卡片 ## 基准测试细节基准测试类型：SEED-Bench-2 是一个针对多模态大语言模型（MLLMs）的全面大规模基准测试，包含带有精确人工标注的 24K 个多选题。它涵盖了 27 个评估维度，对文本和图像生成能力进行综合评估。基准测试日期：SEED-Bench 于 2023 年 11 月收集。更多信息和资源：[SEED-BenchGitHub 仓库](https://github.com/AILab-CVC/SEED-Bench) 许可：……请参阅数据集页面上的完整描述：[Hugging Face 数据集页面](https://huggingface.co/datasets/AILab-CVC/SEED-Bench-2).

提供机构：

huggingface.co

5,000+

优质数据集

54 个

任务类型

进入经典数据集