TVQA+

Name: TVQA+
Creator: University of North Carolina
License: 暂无描述

arXiv2025-09-30 收录

下载链接：

https://github.com/jayleicn/tvqaplus

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集名为TVQA+，是首个融合了空间和时间注释的视频问答数据集，它基于原始的TVQA数据集构建。TVQA+包含了2,527个类别的注释，并支持三种任务：问答、时间定位和空间定位。该数据集的规模包括来自4,198个视频片段的29,383对问答，以及148,468张图像上标注的310,826个边界框。其任务重点在于进行时空视频问答。

This dataset, named TVQA+, is the first video question answering (QA) dataset that integrates both spatial and temporal annotations, and it is constructed based on the original TVQA dataset. TVQA+ includes annotations for 2,527 categories and supports three tasks: question answering, temporal localization, and spatial localization. In terms of dataset scale, it contains 29,383 QA pairs derived from 4,198 video clips, along with 310,826 bounding boxes annotated on 148,468 images. The core focus of its tasks is spatio-temporal video question answering.

提供机构：

University of North Carolina

搜集汇总

数据集介绍

背景与挑战

背景概述

TVQA+是首个融合时空注释的视频问答数据集，基于原始TVQA构建，支持问答、时间定位和空间定位三种任务。该数据集包含来自4,198个视频片段的29,383对问答，并在148,468张图像上标注了310,826个边界框，旨在促进时空视频问答研究。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集