SEED-Bench-2
收藏huggingface.co2025-03-24 收录
下载链接:
https://huggingface.co/datasets/AILab-CVC/SEED-Bench-2
下载链接
链接失效反馈官方服务:
资源简介:
SEED-Bench Card
Benchmark details
Benchmark type:
SEED-Bench-2 is a comprehensive large-scale benchmark for evaluating Multimodal Large Language Models (MLLMs), featuring 24K multiple-choice questions with precise human annotations.
It spans 27 evaluation dimensions, assessing both text and image generation.
Benchmark date:
SEED-Bench was collected in November 2023.
Paper or resources for more information:
https://github.com/AILab-CVC/SEED-Bench
License:… See the full description on the dataset page: https://huggingface.co/datasets/AILab-CVC/SEED-Bench-2.
SEED-Bench 卡片
## 基准测试细节
基准测试类型:SEED-Bench-2 是一个针对多模态大语言模型(MLLMs)的全面大规模基准测试,包含带有精确人工标注的 24K 个多选题。它涵盖了 27 个评估维度,对文本和图像生成能力进行综合评估。
基准测试日期:SEED-Bench 于 2023 年 11 月收集。
更多信息和资源:[SEED-BenchGitHub 仓库](https://github.com/AILab-CVC/SEED-Bench)
许可:……请参阅数据集页面上的完整描述:[Hugging Face 数据集页面](https://huggingface.co/datasets/AILab-CVC/SEED-Bench-2).
提供机构:
huggingface.co



