VCR (Visual Commonsense Reasoning)

Name: VCR (Visual Commonsense Reasoning)
Creator: OpenDataLab
Published: 2026-05-24 11:30:42
License: 暂无描述

OpenDataLab2026-05-24 更新2024-05-09 收录

下载链接：

https://opendatalab.org.cn/OpenDataLab/VCR_Visual_Commonsense_Reasoning

下载链接

链接失效反馈

官方服务：

资源简介：

VCR的全称是视觉常识推理，是视觉常识推理的大规模数据集。这个数据集提出了关于图像的具有挑战性的问题，机器需要完成两个子任务: 正确回答问题和提供理由证明其答案。 VCR数据集包含大量问题，212K用于训练，26K用于验证，25K用于测试。答案和理由来自超过110K个独特的电影场景。

VCR, short for Visual Commonsense Reasoning, is a large-scale dataset for visual commonsense reasoning. This dataset poses challenging questions related to images and requires machines to complete two subtasks: correctly answering the questions and providing justifications to support their answers. The VCR dataset contains a large quantity of questions, with 212K allocated for training, 26K for validation, and 25K for testing. Both the answers and justifications are sourced from over 110K unique movie scenes.

提供机构：

OpenDataLab

创建时间：

2023-04-20

搜集汇总

数据集介绍