LLaVA-NeXT-Interleave-Bench

Name: LLaVA-NeXT-Interleave-Bench
Creator: maas
Published: 2025-11-14 16:17:26
License: 暂无描述

魔搭社区2025-11-14 更新2024-11-23 收录

下载链接：

https://modelscope.cn/datasets/lmms-lab/LLaVA-NeXT-Interleave-Bench

下载链接

链接失效反馈

官方服务：

资源简介：

# LLaVA-Interleave Bench Dataset Card ## Dataset details **Dataset type:** LLaVA-Interleave Bench is a comprehensive set of multi-image datasets that are collected from public datasets or generated by the GPT-4V API. It is constructed for evaluating the interleaved multi-image reaoning capbilities of LMMs. **Dataset date:** LLaVA-Interleave Bench was collected in April 2024, and released in June 2024. **Paper or resources for more information:** Blog: https://llava-vl.github.io/blog/2024-06-16-llava-next-interleave/ **Evaluate LLaVA-NeXT Interleave Model** ### Preparation Please download the evaluation data first. Unzip eval_images.zip and there are Split1 and Split2 in it. Organize the downloaded data into the following structure: ``` interleave_data ├── Split1 │ ├── ... │ └── ... | ├── Split2 | ├── ... │ └── ... ├── multi_image_in_domain.json ├── multi_image_out_domain.json └── multi_view_in_domain.json ``` ### Inference and Evaluation Example: Please first edit /path/to/ckpt to the path of checkpoint, /path/to/images to the path of "interleave_data" in scripts/interleave/eval_all.sh and then run ```bash bash scripts/interleave/eval_all.sh ``` Note that, the MMMU-mv data is currently not included in the released data. We will release it soon. To construct MMMU-mv, we sample the 789 questions with multiple input images from the [official test set](https://huggingface.co/datasets/MMMU/MMMU) of MMMU. **License:** Creative Commons Attribution 4.0 International; and it should abide by the policy of OpenAI: https://openai.com/policies/terms-of-use **Where to send questions or comments about the model:** fliay@connect.ust.hk ## Intended use **Primary intended uses:** The primary use of LLaVA-Next Interleave is research on large multimodal models and chatbots. **Primary intended users:** The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.

# LLaVA-Interleave Bench 数据集卡片 ## 数据集详情 **数据集类型：** LLaVA-Interleave Bench 是一套综合性多图像数据集，其数据源自公开数据集或通过 GPT-4V API 生成，旨在评估多模态大模型（Large Multimodal Model, LMM）的交错多图像推理能力。 **数据集采集与发布时间：** 本数据集于2024年4月完成采集，2024年6月正式发布。 **更多信息参考的论文或资源：** 博客：https://llava-vl.github.io/blog/2024-06-16-llava-next-interleave/ **评估 LLaVA-NeXT Interleave 模型** ### 准备工作请先下载评估数据。解压 eval_images.zip 后，内部将包含 Split1 与 Split2 两个子目录。请将下载的数据按如下结构组织： interleave_data ├── Split1 │ ├── ... │ └── ... | ├── Split2 | ├── ... │ └── ... ├── multi_image_in_domain.json ├── multi_image_out_domain.json └── multi_view_in_domain.json ### 推理与评估示例：请先在 scripts/interleave/eval_all.sh 脚本中，将 /path/to/ckpt 修改为模型检查点路径，将 /path/to/images 修改为上述 interleave_data 目录的路径，随后运行如下命令： bash bash scripts/interleave/eval_all.sh 注意：当前发布的数据未包含 MMMU-mv 数据集，我们将尽快推出该数据集的发布。MMMU-mv 的构建方式为：从 MMMU 的[官方测试集](https://huggingface.co/datasets/MMMU/MMMU)中采样789道包含多幅输入图像的问题。 **许可证：** 知识共享署名4.0国际许可协议（Creative Commons Attribution 4.0 International）；同时需遵守 OpenAI 的相关政策：https://openai.com/policies/terms-of-use **问题与意见反馈渠道：** fliay@connect.ust.hk ## 预期用途 **主要预期用途：** LLaVA-Next Interleave 的核心用途为多模态大模型与聊天机器人相关的研究工作。 **主要目标用户群体：** 本模型的主要目标用户为计算机视觉、自然语言处理、机器学习与人工智能领域的研究人员与爱好者。

提供机构：

maas

创建时间：

2024-10-07

5,000+

优质数据集

54 个

任务类型

进入经典数据集