adymaharana/cococon

Name: adymaharana/cococon
Creator: adymaharana
Published: 2023-04-10 16:25:13
License: 暂无描述

Hugging Face2023-04-10 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/adymaharana/cococon

下载链接

链接失效反馈

官方服务：

资源简介：

--- annotations_creators: - crowdsourced language: - en language_creators: - found license: - cc-by-4.0 multilinguality: - monolingual pretty_name: CoCoCON size_categories: - 1K<n<10K tags: - consistency - visual-reasoning task_ids: [] --- # Dataset Card for CoCoCON - [Dataset Description](#dataset-description) - [Languages](#languages) - [Dataset Structure](#dataset-structure) - [Data Fields](#data-fields) - [Data Splits](#data-splits) - [Dataset Creation](#dataset-creation) - [Considerations for Using the Data](#considerations-for-using-the-data) - [Licensing Information](#licensing-information) - [Citation Information](#citation-information) ## Dataset Description CocoCON is a challenging dataset for evaluating cross-task consistency in vision-and-language models. We use contrast sets created by modifying COCO test instances for multiple tasks in small but semantically meaningful ways to change the gold label, and outline metrics for measuring if a model is consistent by ranking the original and perturbed instances across tasks. We find that state-of-the-art systems suffer from a surprisingly high degree of inconsistent behavior across tasks, especially for more heterogeneous tasks. - **Homepage:** https://adymaharana.github.io/cococon/ - **Repository:** https://github.com/adymaharana/cococon - **Paper:** https://arxiv.org/abs/2303.16133 - **Point of Contact:** adyasha@cs.unc.edu; ### Languages English. ## Dataset Structure Each sample in this dataset corresponds to a COCO image, a set of ground truth annotations for the image captioning, visual question-answering (VQA), and localization (optional) tasks, and their respective contrast sets. ### Data Fields caption (string): ground truth caption. query (string): VQA question. answer (string): ground truth VQA answer. question_id (int64): unordered unique identifier for sample. image_id (int64): COCO image id. detection (string): (optional) localization query. boxes (list): (optional) list of ground truth bounding boxes for the localization query. contrast_sets: Each sample in "contrast_sets" is a set of perturbed annotations corresponding to the ground truth annotations. Perturbed annotations are prefixed with "mutex_". file_name (string): COCO filename for the image. coco_url (string): url for downloading the image from the COCO server. flickr_url (string): url for downloading the image from Flickr. height (int64): height of image. width (int64): width of image. id (int64): ordered unique identifier for sample. ### Data Splits The CocoCON benchmark is an evaluation-only dataset. The data accessible through this link should be considered as the test split. ## Dataset Creation The CoCoCON dataset is created by a combination of machine + expert annotators who perturbed ground truth COCO annotations to create contrast sets. ## Considerations for Using the Data ### Licensing Information CC-By 4.0 ### Citation Information @article{maharana2023cococon, author = {Maharana, Adyasha and Kamath, Amita and Clark, Christopher and Bansal, Mohit and Kembhavi, Aniruddha}, title = {Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models.}, journal = {arxiv}, year = {2023}, }

提供机构：

adymaharana

原始信息汇总

数据集概述

名称： CoCoCON

语言： 英语

创建方式： 机器 + 专家注释者

许可证： CC-By 4.0

数据集大小： 1K<n<10K

标签：

一致性
视觉推理

数据集描述

CoCoCON是一个用于评估视觉与语言模型跨任务一致性的挑战性数据集。该数据集通过修改COCO测试实例创建对比集，以小但语义上有意义的方式改变黄金标签，并定义了衡量模型是否在跨任务中保持一致的排名指标。

数据集结构

数据字段：

caption (字符串): 地面实况标题。
query (字符串): VQA问题。
answer (字符串): 地面实况VQA答案。
question_id (int64): 样本的无序唯一标识符。
image_id (int64): COCO图像ID。
detection (字符串): (可选) 本地化查询。
boxes (列表): (可选) 本地化查询的地面实况边界框列表。
contrast_sets: 包含与地面实况注释对应的扰动注释集。
file_name (字符串): COCO图像文件名。
coco_url (字符串): 从COCO服务器下载图像的URL。
flickr_url (字符串): 从Flickr下载图像的URL。
height (int64): 图像高度。
width (int64): 图像宽度。
id (int64): 样本的有序唯一标识符。

数据分割：

CocoCON基准是一个仅用于评估的数据集，通过此链接访问的数据应视为测试分割。

数据集创建

CoCoCON数据集由机器和专家注释者共同创建，通过扰动COCO地面实况注释来生成对比集。

使用数据注意事项

许可证信息： CC-By 4.0

引用信息：

@article{maharana2023cococon, author = {Maharana, Adyasha and Kamath, Amita and Clark, Christopher and Bansal, Mohit and Kembhavi, Aniruddha}, title = {Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models.}, journal = {arxiv}, year = {2023}, }

5,000+

优质数据集

54 个

任务类型

进入经典数据集