five

opencompass/Creation-MMBench

收藏
Hugging Face2025-03-19 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/opencompass/Creation-MMBench
下载链接
链接失效反馈
官方服务:
资源简介:
Creation-MMBench是一个专门为评估多模态大型语言模型(MLLM)创造力而设计的多模态基准数据集。它包含765个测试用例,跨越51个细粒度任务,为MLLM提供了图像和上下文信息,包括角色、背景信息和指令等。数据集还包含了为每个测试用例精心设计的实例特定标准,以评估模型生成的内容的通用响应质量和视觉事实对齐性。此外,该数据集还提供了丰富的创造性问题,并采用多图像格式,每个问题都设计有特定角色,以激发MLLM的创造力。

Creation-MMBench is a multimodal benchmark specifically designed to evaluate the creative capabilities of Multimodal Large Language Models (MLLMs). It features 765 test cases spanning 51 fine-grained tasks, providing MLLMs with images and context, including role, background information, and instructions. The dataset includes carefully crafted instance-specific criteria for each test case to assess both general response quality and visual-factual alignment in model-generated content. Additionally, the dataset offers a rich set of creative questions and adopts a multi-image format, with each question designed to stimulate MLLMs creative capabilities through specific roles.
提供机构:
opencompass
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作