VCGBench-Diverse

Name: VCGBench-Diverse
Creator: maas
Published: 2025-11-27 16:26:29
License: 暂无描述

魔搭社区2025-11-27 更新2025-03-22 收录

下载链接：

https://modelscope.cn/datasets/MBZUAI/VCGBench-Diverse

下载链接

链接失效反馈

官方服务：

资源简介：

# 👁️ VCGBench-Diverse Benchmarks --- ## 📝 Description Recognizing the limited diversity in existing video conversation benchmarks, we introduce VCGBench-Diverse to comprehensively evaluate the generalization ability of video LMMs. While VCG-Bench provides an extensive evaluation protocol, it is limited to videos from the ActivityNet200 dataset. Our benchmark comprises a total of 877 videos, 18 broad video categories and 4,354 QA pairs, ensuring a robust evaluation framework. <p align="center"> <img src="vcgbench_diverse.png" alt="Contributions"> </p> ## Dataset Contents 1. `vcgbench_diverse_qa.json` - Contains VCGBench-Diverse question-answer pairs. 2. `videos.tar.gz` - Contains the videos corresponding to `vcgbench_diverse_qa.json`. 3. `human_annotated_video_descriptions` - Contains original human-annotated dense descriptions of the videos. 4. `gpt_evaluation_scripts` - Contains the GPT-3.5-Turbo evaluation scripts to evaluate a model's predictions. 5. `sample_predictions` - Contains the VideoGPT+ predictions on the VCGBench-Diverse. Compatible with `gpt_evaluation_scripts`. In order to evaluate your model on `VCGBench-Diverse`, use question-answer pairs in `vcgbench_diverse_qa.json` to generate your model's predictions in format same as `sample_predictions` and then use `gpt_evaluation_scripts` for the evalution. ## 💻 Download To get started, follow these steps: ``` git lfs install git clone https://huggingface.co/MBZUAI/VCGBench-Diverse ``` ## 📚 Additional Resources - **Paper:** [ArXiv](https://arxiv.org/abs/2406.09418). - **GitHub Repository:** For training and updates: [GitHub](https://github.com/mbzuai-oryx/VideoGPT-plus). - **HuggingFace Collection:** For downloading the pretrained checkpoints, VCGBench-Diverse Benchmarks and Training data, visit [HuggingFace Collection - VideoGPT+](https://huggingface.co/collections/MBZUAI/videogpt-665c8643221dda4987a67d8d). ## 📜 Citations and Acknowledgments ```bibtex @article{Maaz2024VideoGPT+, title={VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding}, author={Maaz, Muhammad and Rasheed, Hanoona and Khan, Salman and Khan, Fahad Shahbaz}, journal={arxiv}, year={2024}, url={https://arxiv.org/abs/2406.09418} }

# 👁️ VCGBench-Diverse基准测试集 --- ## 📝 数据集概述针对现有视频对话基准测试集多样性不足的问题，我们提出VCGBench-Diverse，以全面评估视频大语言模型（Large Language Model, LLM）的泛化能力。尽管VCG-Bench已提供了较为完善的评估协议，但其仅局限于使用ActivityNet200数据集的视频。本基准测试集共计包含877段视频、18个宽泛的视频类别以及4354组问答对（Question-Answering pairs, QA对），可构建起一套稳健的评估框架。 <p align="center"> <img src="vcgbench_diverse.png" alt="贡献说明"> </p> ## 数据集内容 1. `vcgbench_diverse_qa.json` - 包含VCGBench-Diverse的问答对数据。 2. `videos.tar.gz` - 包含与`vcgbench_diverse_qa.json`对应的视频文件。 3. `human_annotated_video_descriptions` - 包含人工标注的视频详细描述文本。 4. `gpt_evaluation_scripts` - 包含用于评估模型预测结果的GPT-3.5-Turbo评估脚本。 5. `sample_predictions` - 包含VideoGPT+在VCGBench-Diverse上的模型预测结果，可与`gpt_evaluation_scripts`兼容使用。若要在VCGBench-Diverse上评估你的模型，请使用`vcgbench_diverse_qa.json`中的问答对生成模型预测结果，预测格式需与`sample_predictions`保持一致，随后通过`gpt_evaluation_scripts`完成评估流程。 ## 💻 下载方式若要开始使用，请按照以下步骤操作： git lfs install git clone https://huggingface.co/MBZUAI/VCGBench-Diverse ## 📚 附加资源 - **论文**：[ArXiv](https://arxiv.org/abs/2406.09418)。 - **GitHub仓库**：用于获取训练相关内容与更新：[GitHub](https://github.com/mbzuai-oryx/VideoGPT-plus)。 - **HuggingFace合集**：若要下载预训练检查点（checkpoint）、VCGBench-Diverse基准测试集与训练数据，请访问 [HuggingFace合集 - VideoGPT+](https://huggingface.co/collections/MBZUAI/videogpt-665c8643221dda4987a67d8d)。 ## 📜 引用与致谢 bibtex @article{Maaz2024VideoGPT+, title={VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding}, author={Maaz, Muhammad and Rasheed, Hanoona and Khan, Salman and Khan, Fahad Shahbaz}, journal={arxiv}, year={2024}, url={https://arxiv.org/abs/2406.09418} }

提供机构：

maas

创建时间：

2025-03-17

5,000+

优质数据集

54 个

任务类型

进入经典数据集