ZS-F-VQA

Name: ZS-F-VQA
Creator: OpenDataLab
Published: 2026-05-24 10:30:18
License: 暂无描述

OpenDataLab2026-05-24 更新2024-05-09 收录

下载链接：

https://opendatalab.org.cn/OpenDataLab/ZS-F-VQA

下载链接

链接失效反馈

官方服务：

资源简介：

ZS-F-VQA 数据集是针对零样本问题的 F-VQA 数据集的新拆分。首先，我们获得 F-VQA 数据集的原始训练/测试分割，并将它们组合在一起，根据其出现频率过滤出答案出现在前 500 名的三元组。接下来，我们以 1:1 的比例将这组答案随机分成新的训练拆分（也称为可见）$\mathcal{A}_s$ 和测试拆分（又名未见过）$\mathcal{A}_u$。参考 F-VQA 标准数据集，划分过程重复 5 次。对于原始 F-VQA 数据集中的每个 $(i,q,a)$ 三元组，如果 $a \in \mathcal{A}_s$ 则将其划分为训练集。否则它被分成测试集。 F-VQA 中训练和测试集的答案实例重叠为 2565 美元，而 ZS-F-VQA 中为 0 美元。

The ZS-F-VQA dataset is a novel split of the F-VQA dataset designed for zero-shot problems. First, we retrieve the original train and test splits of the F-VQA dataset, merge them into a unified corpus, and then filter out triples whose answers rank among the top 500 in terms of occurrence frequency. Next, we randomly split this set of candidate answers into a new training split (also termed visible) $mathcal{A}_s$ and a test split (also referred to as unseen) $mathcal{A}_u$ at a 1:1 ratio. Following the standard protocol of the F-VQA dataset, we repeat this splitting process 5 times. For each $(i,q,a)$ triple in the original F-VQA dataset, if the answer $a in mathcal{A}_s$, we assign it to the training set; otherwise, it is allocated to the test set. The number of overlapping answer instances between the training and test splits of the original F-VQA dataset is 2565, while that of the ZS-F-VQA dataset is 0.

提供机构：

OpenDataLab

创建时间：

2022-06-28

搜集汇总

数据集介绍