V2C (Video-to-Commonsense)

Name: V2C (Video-to-Commonsense)
Creator: OpenDataLab
Published: 2026-05-24 08:30:13
License: 暂无描述

OpenDataLab2026-05-24 更新2024-05-09 收录

下载链接：

https://opendatalab.org.cn/OpenDataLab/V2C

下载链接

链接失效反馈

官方服务：

资源简介：

我们展示了直接从视频中生成常识字幕的第一项工作，以描述意图、效果和属性等潜在方面。我们提出了一个新的数据集“视频到常识（V2C）”，其中包含约 9k 个人类代理执行各种动作的视频，并用 3 种常识描述进行注释。

This work presents the first study to generate commonsense captions directly from videos, which describe latent aspects such as intentions, effects and attributes. We introduce a novel dataset named Video-to-Commonsense (V2C), which contains approximately 9k videos of human agents performing various actions, and all these videos are annotated with three types of commonsense descriptions.

提供机构：

OpenDataLab

创建时间：

2022-06-07

搜集汇总

数据集介绍