Vision-CAIR/VLV-Benchmark

Name: Vision-CAIR/VLV-Benchmark
Creator: Vision-CAIR
Published: 2024-06-10 16:03:58
License: 暂无描述

Hugging Face2024-06-10 更新2024-06-22 收录

下载链接：

https://hf-mirror.com/datasets/Vision-CAIR/VLV-Benchmark

下载链接

链接失效反馈

官方服务：

资源简介：

# VLV-Bench: A Comprehensive benchmark for very long-form videos understanding ![VLV-Bench teaser figure](repo_imags/VLV_Bench.JPG) # How to download videos 1- TVQA videos <br> 2- MovieNet Data # Annotation files You can find the annotation files for the 9 skills here ![annotation_pipeline](repo_imags/annotation_pipeline.jpg) # How to create the Benchmark ## Data scrapping 1- We scrapped the all the TVQA summaries from IMDB. <br> 2- We scrapped the all the MovieNet summaries from IMDB. <br> 3- We scrapped the transcripts for all the TVQA videos. <br> If you're using VLV-Bench in your research or applications, please cite using this BibTeX: ```  ``` ## Acknowledgements [Video-ChatGPT](https://mbzuai-oryx.github.io/Video-ChatGPT) ## License This repository is under [BSD 3-Clause License](LICENSE.md).

提供机构：

Vision-CAIR

原始信息汇总

VLV-Bench: 非常长视频理解的综合基准

数据集下载

TVQA视频
MovieNet数据

标注文件

提供了9个技能的标注文件。

基准创建方法

数据抓取

从IMDB抓取所有TVQA的摘要。
从IMDB抓取所有MovieNet的摘要。
抓取所有TVQA视频的转录文本。

引用

如果您在研究或应用中使用VLV-Bench，请使用以下BibTeX引用：

@article{ataallah2024minigpt4, title={MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens}, author={Ataallah, Kirolos and Shen, Xiaoqian and Abdelrahman, Eslam and Sleiman, Essam and Zhu, Deyao and Ding, Jian and Elhoseiny, Mohamed}, journal={arXiv preprint arXiv:2404.03413}, year={2024} }

搜集汇总

数据集介绍

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集