lmms-lab/LLaVA-Video-178K

Name: lmms-lab/LLaVA-Video-178K
Creator: lmms-lab
Published: 2024-10-11 04:59:25
License: 暂无描述

Hugging Face2024-10-11 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/lmms-lab/LLaVA-Video-178K

下载链接

链接失效反馈

官方服务：

资源简介：

LLaVA-Video-178K数据集是一个用于训练LLaVA-Video模型的大规模视频语言数据集，包含178,510条字幕条目、960,792个开放式问答项和196,198个多选题问答项。这些数据来源于五个主要来源：LLaVA-Video-178K、NeXT-QA、ActivityNetQA、PerceptionTest和LLaVA-Hound。数据集主要用于学术研究和教育目的，遵循Apache License 2.0许可。

The LLaVA-Video-178K dataset is a large-scale video-language dataset used for training the LLaVA-Video model, containing 178,510 caption entries, 960,792 open-ended QA items, and 196,198 multiple-choice QA items. The data is sourced from five primary sources: LLaVA-Video-178K, NeXT-QA, ActivityNetQA, PerceptionTest, and LLaVA-Hound. The dataset is primarily used for academic research and educational purposes, under the Apache License 2.0.

提供机构：

lmms-lab

搜集汇总

背景与挑战

背景概述

LLaVA-Video-178K是一个大规模视频语言数据集，包含超过178,000条字幕、960,000个开放式问答和196,000个多选题问答，数据来源于多个来源，用于训练视频语言模型，适用于学术研究和教育目的，遵循开源许可。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集