Visual Instruction Samples
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/jefferyZhan/Griffon/tree/master/Griffon-R
下载链接
链接失效反馈官方服务:
资源简介:
该数据集精心筛选了33.4万个视觉指令样本,覆盖了一般场景和富含文本的场景,旨在提升模型的视觉推理能力。此外,该数据集旨在充分利用大型多模态模型处理复杂视觉推理任务的内禀能力。该数据集规模达到33.4万样本,适用于视觉推理和问答任务。
This dataset carefully curates 334,000 visual instruction samples spanning both general scenarios and text-rich scenes, with the goal of enhancing the visual reasoning capabilities of models. Additionally, this dataset is intended to fully leverage the intrinsic capacities of large multimodal models for complex visual reasoning tasks. Containing 334,000 samples in total, this dataset is suitable for visual reasoning and question answering tasks.



