VALUE 视频语言理解评估数据集

超神经2022-10-24 更新2024-05-15 收录

下载链接：

https://hyper.ai/cn/datasets/18983

下载链接

链接失效反馈

官方服务：

资源简介：

VALUE 全称 Video-And-Language Understanding Evaluation，是一个关于视频和语言理解评估的数据集。该数据集是 11 个 VidL（视频和语言）数据集的集合，可用于 3 个常见任务：文本到视频检索；视频问题解答以及视频字幕。 VALUE 目标是涵盖广泛的视频类型、视频长度、数据量和任务难度级别。 VALUE 不仅专注于单通道视频视觉信息，也推广利用视频帧及其相关字幕信息的模型和跨多个任务共享知识的模型。

VALUE stands for Video-And-Language Understanding Evaluation, which is a dataset for video-and-language understanding evaluation. This dataset is a collection of 11 VidL (Video-and-Language) datasets, and supports three common tasks: text-to-video retrieval, video question answering, and video captioning. VALUE aims to cover a wide range of video genres, video durations, dataset scales, and task difficulty levels. VALUE not only focuses on single-channel video visual information, but also promotes models that leverage video frames and their associated caption information, as well as models that share knowledge across multiple tasks.

创建时间：

2022-10-24

搜集汇总

数据集介绍

背景与挑战

背景概述

VALUE（Video-And-Language Understanding Evaluation）是一个视频语言理解评估数据集，集合了11个VidL数据集，支持文本到视频检索、视频问题解答和视频字幕三个核心任务。它旨在覆盖多样化的视频类型、长度和难度，并促进利用视频帧与字幕信息的多模态模型开发。

以上内容由遇见数据集搜集并总结生成