LLaVA-v1.5-mix

Name: LLaVA-v1.5-mix
Creator: OpenAI
License: 暂无描述

arXiv2025-09-30 收录

下载链接：

https://huggingface.co/datasets/liuhaotian/LLaVA-Instruct-150K/blob/main/llava_v1_5_mix665k.json

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是一个混合型数据集，它结合了COCO 2017、GQA、OCR-VQA、TextVQA和VisualGenome的数据，主要用于视觉-语言模型的指令微调。此外，该数据集包含了66.5万个条目，这对于提升模型在遵循指令任务中的性能至关重要。该任务旨在对视觉-语言模型进行指令微调。

This is a hybrid dataset that combines data from COCO 2017, GQA, OCR-VQA, TextVQA, and VisualGenome, and is primarily intended for instruction tuning of vision-language models. Furthermore, this dataset comprises 665,000 entries, which is critical for enhancing the model's performance on instruction-following tasks. This task aims to perform instruction tuning on vision-language models.

提供机构：

OpenAI

搜集汇总

数据集介绍

背景与挑战

背景概述

LLaVA-v1.5-mix是一个用于视觉问答和问答任务的英语数据集，包含665K的数据混合，数据规模在100K到1M之间，采用cc-by-4.0许可证。文件大小为1.03 GB，存储为Xet指针格式。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集