cc-sbu-align

arXiv2025-09-30 收录

下载链接：

https://github.com/haotian-liu/LLaVA

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集名为cc-sbu-align，包含了3500对详细的图像描述对，已被用于MiniGPT4的视觉指令调整。此外，该数据集还被作为干净的训练数据集，用于微调视觉语言模型。其规模为3500对，任务是对视觉指令进行调整。

This dataset, named cc-sbu-align, includes 3500 detailed image-caption pairs. It has been utilized for visual instruction tuning in MiniGPT4. Additionally, this dataset serves as a clean training dataset for fine-tuning visual-language models. The core task of this dataset is visual instruction tuning, with a total of 3500 samples.

搜集汇总

数据集介绍

背景与挑战

背景概述

LLaVA是一个多模态助手项目，结合了大型语言模型和视觉模型，支持视觉指令调整和多种评估方法。项目提供了丰富的模型权重、训练脚本和资源，适用于多种应用场景。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集