opencompass/Creation-MMBench

Name: opencompass/Creation-MMBench
Creator: opencompass
Published: 2025-03-19 11:44:29
License: 暂无描述

Hugging Face2025-03-19 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/opencompass/Creation-MMBench

下载链接

链接失效反馈

官方服务：

资源简介：

Creation-MMBench是一个专门为评估多模态大型语言模型（MLLM）创造力而设计的多模态基准数据集。它包含765个测试用例，跨越51个细粒度任务，为MLLM提供了图像和上下文信息，包括角色、背景信息和指令等。数据集还包含了为每个测试用例精心设计的实例特定标准，以评估模型生成的内容的通用响应质量和视觉事实对齐性。此外，该数据集还提供了丰富的创造性问题，并采用多图像格式，每个问题都设计有特定角色，以激发MLLM的创造力。

Creation-MMBench is a multimodal benchmark specifically designed to evaluate the creative capabilities of Multimodal Large Language Models (MLLMs). It features 765 test cases spanning 51 fine-grained tasks, providing MLLMs with images and context, including role, background information, and instructions. The dataset includes carefully crafted instance-specific criteria for each test case to assess both general response quality and visual-factual alignment in model-generated content. Additionally, the dataset offers a rich set of creative questions and adopts a multi-image format, with each question designed to stimulate MLLMs creative capabilities through specific roles.

提供机构：

opencompass

5,000+

优质数据集

54 个

任务类型

进入经典数据集