anith-deminchong/LLaVA-Instruct-150K
收藏Hugging Face2026-03-13 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/anith-deminchong/LLaVA-Instruct-150K
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
task_categories:
- visual-question-answering
- question-answering
language:
- en
pretty_name: LLaVA Visual Instruct 150K
size_categories:
- 100K<n<1M
---
# LLaVA Visual Instruct 150K Dataset Card
## Dataset details
**Dataset type:**
LLaVA Visual Instruct 150K is a set of GPT-generated multimodal instruction-following data.
It is constructed for visual instruction tuning and for building large multimodal towards GPT-4 vision/language capability.
**Dataset date:**
LLaVA Visual Instruct 150K was collected in April 2023, by prompting GPT-4-0314 API.
**Paper or resources for more information:**
https://llava-vl.github.io/
**License:**
Creative Commons Attribution 4.0 International; and it should abide by the policy of OpenAI: https://openai.com/policies/terms-of-use
**Where to send questions or comments about the model:**
https://github.com/haotian-liu/LLaVA/issues
## Intended use
**Primary intended uses:**
The primary use of LLaVA is research on large multimodal models and chatbots.
**Primary intended users:**
The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.
许可证:CC BY 4.0
任务类别:
- 视觉问答
- 问答
语言:
- 英语
数据集名称:LLaVA视觉指令150K
数据规模:
- 100K<n<1M
---
# LLaVA视觉指令150K 数据集卡片
## 数据集详情
**数据集类型:**
LLaVA视觉指令150K是一套由GPT生成的多模态指令遵循数据集,专为视觉指令微调以及构建具备GPT-4视觉-语言能力的大型多模态模型而打造。
**数据集采集时间:**
LLaVA视觉指令150K于2023年4月通过调用GPT-4-0314 API完成采集。
**相关论文或参考资源:**
https://llava-vl.github.io/
**许可证:**
知识共享署名4.0国际许可协议;同时需遵守OpenAI的使用条款:https://openai.com/policies/terms-of-use
**数据集相关问题或意见反馈渠道:**
https://github.com/haotian-liu/LLaVA/issues
## 预期用途
**核心预期用途:**
本数据集的核心用途为大型多模态模型及聊天机器人的相关研究。
**核心目标用户:**
本数据集的主要使用者为计算机视觉、自然语言处理、机器学习以及人工智能领域的研究人员与爱好者。
提供机构:
anith-deminchong



