LongerVideos
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/HKUDS/VideoRAG
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个全面性的基准测试,包含了超过二十个视频集,分为三大类:讲座视频、纪录片视频和娱乐视频。它旨在评估针对长视频内容的检索增强生成框架。每个视频集总时长平均超过4小时,包含1到20多个独立视频,并且还包括从视频字幕生成的高质量查询。该数据集规模宏大,包含超过160个视频,生成了600多个多样化的查询,为视频为基础的问答任务提供了一个健壮的评估集合。
This dataset is a comprehensive benchmark consisting of over twenty video collections, categorized into three major groups: lecture videos, documentary videos, and entertainment videos. It is designed to evaluate retrieval-augmented generation frameworks for long-form video content. Each video collection has an average total duration of over 4 hours, contains 1 to more than 20 individual videos, and also includes high-quality queries generated from video subtitles. Boasting a substantial scale, this dataset comprises over 160 videos and over 600 diverse queries, providing a robust evaluation collection for video-based question answering tasks.
提供机构:
Authors of the paper



