ViLCo: VIdeo Language COntinual learning Benchmark

NIAID Data Ecosystem2026-05-02 收录

下载链接：

https://zenodo.org/record/11560094

下载链接

链接失效反馈

官方服务：

资源简介：

We introduce the first VIdeo Language COntinual learning Benchmark (ViLCo-Bench). Video language continual learning involves continuously adapting to information from video and text inputs, enhancing a model’s ability to handle new tasks while retaining prior knowledge. This field is a relatively under-explored area, and establishing appropriate datasets is crucial for facilitating communication and research in this field. In this study, we present the first dedicated benchmark, ViLCo-Bench, designed to evaluate continual learning models across a range of video-text tasks. The dataset comprises ten-minute-long videos and corresponding language queries collected from publicly available datasets. Additionally, we introduce a novel memory-efficient framework that incorporates self-supervised learning and mimics long-term and short-term memory effects. This framework addresses challenges including memory complexity from long video clips, natural language complexity from open queries, and text-video misalignment. We posit that ViLCo-Bench, with greater complexity compared to existing continual learning benchmarks, would serve as a critical tool for exploring the video-language domain, extending beyond conventional class-incremental tasks, and addressing complex and limited annotation issues. More detailed information can also be found on our url: https://github.com/cruiseresearchgroup/ViLCo

创建时间：

2024-07-02

5,000+

优质数据集

54 个

任务类型

进入经典数据集