OpenGVLab/CRPE

Name: OpenGVLab/CRPE
Creator: OpenGVLab
Published: 2024-03-21 14:42:11
License: 暂无描述

Hugging Face2024-03-21 更新2024-05-25 收录

下载链接：

https://hf-mirror.com/datasets/OpenGVLab/CRPE

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: apache-2.0 --- # Circular-based Relation Probing Evaluation (CRPE) CRPE is a benchmark designed to quantitatively evaluate the object recognition and relation comprehension ability of models. The evaluation is formulated as single-choice questions. The benchmark consists of four splits: **Existence**, **Subject**, **Predicate**, and **Object**. The **Existence** split evaluates the object recognition ability while the remaining splits are designed to evaluate the capability of relation comprehension, focusing on probing each of the elements in the relation triplets `(subject, predicate, object)` separately. Some data examples are shown below. <img width="800" alt="image" src="https://cdn-uploads.huggingface.co/production/uploads/619507e7b74b6c591f794340/_NKaowl2OUBAjck1XCAPm.jpeg"> Additionally, to evaluate the dependency on language priors, we also include abnormal data in our evaluation. These images in these abnormal data depict relation triplets that are very rare in the real world. <img width="800" alt="image" src="https://cdn-uploads.huggingface.co/production/uploads/619507e7b74b6c591f794340/qKWw7Qb93OXClxI_VrCRk.jpeg"> For a robust evaluation, we adopt CircularEval as our evaluation strategy. Under this setting, a question is considered as correctly answered only when the model consistently predicts the correct answer in each of the N iterations, with N corresponding to the number of choices. In each iteration, a circular shift is applied to both the choices and the answer to form a new query for the model. CRPE contains the following files: - `crpe_exist.jsonl`: the evaluation data of **Existence** split. - `crpe_exist_meta.jsonl`: the evaluation data of **Existence** split without CircularEval. - `crpe_relation.jsonl`: the evaluation data of **Subject**, **Predicate**, and **Object** split. - `crpe_relation_meta.jsonl`: the evaluation data of **Subject**, **Predicate**, and **Object** split without CircularEval. **NOTE**: You should use `crpe_exist.jsonl` and `crpe_relation.jsonl` for evaluation. The evaluation script is presented [here](https://github.com/OpenGVLab/all-seeing/blob/main/all-seeing-v2/llava/eval/eval_crpe.py). See our [project](https://github.com/OpenGVLab/all-seeing/all-seeing-v2) to learn more details! # Citation If you find our work useful in your research, please consider cite: ```BibTeX @article{wang2023allseeing, title={The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World}, author={Wang, Weiyun and Shi, Min and Li, Qingyun and Wang, Wenhai and Huang, Zhenhang and Xing, Linjie and Chen, Zhe and Li, Hao and Zhu, Xizhou and Cao, Zhiguo and others}, journal={arXiv preprint arXiv:2308.01907}, year={2023} } @article{wang2024allseeing_v2, title={The All-Seeing Project V2: Towards General Relation Comprehension of the Open World}, author={Wang, Weiyun and Ren, Yiming and Luo, Haowen and Li, Tiantong and Yan, Chenxiang and Chen, Zhe and Wang, Wenhai and Li, Qingyun and Lu, Lewei and Zhu, Xizhou and others}, journal={arXiv preprint arXiv:2402.19474}, year={2024} } ```

--- license: apache-2.0 --- # 基于循环的关系探测评估基准（Circular-based Relation Probing Evaluation, CRPE） CRPE是一款专为定量评估模型的目标识别与关系理解能力设计的基准测试集，本次评估以单项选择题为形式构建。该基准测试集包含四个划分子集：**存在性（Existence）**、**主体（Subject）**、**谓词（Predicate）**与**客体（Object）**。其中**存在性**子集用于评估模型的目标识别能力，其余三个子集则聚焦于关系理解能力测评，分别对关系三元组`(subject, predicate, object)`中的各元素进行单独探测。部分数据示例如下： <img width="800" alt="image" src="https://cdn-uploads.huggingface.co/production/uploads/619507e7b74b6c591f794340/_NKaowl2OUBAjck1XCAPm.jpeg"> 此外，为评估模型对语言先验的依赖程度，本次测试还引入了异常数据样本。此类异常数据中的图像所描绘的关系三元组在现实世界中极为罕见。 <img width="800" alt="image" src="https://cdn-uploads.huggingface.co/production/uploads/619507e7b74b6c591f794340/qKWw7Qb93OXClxI_VrCRk.jpeg"> 为保障评估的稳健性，我们采用循环评估（CircularEval）作为评估策略。在该策略下，仅当模型在N轮迭代中均能始终如一地预测出正确答案时，才认为该问题回答正确，其中N等于选项的数量。每一轮迭代中，我们都会对选项与标准答案进行循环移位，为模型生成全新的查询任务。 CRPE包含以下文件： - `crpe_exist.jsonl`：**存在性**子集的评估数据 - `crpe_exist_meta.jsonl`：未采用循环评估策略的**存在性**子集评估数据 - `crpe_relation.jsonl`：**主体**、**谓词**与**客体**子集的评估数据 - `crpe_relation_meta.jsonl`：未采用循环评估策略的**主体**、**谓词**与**客体**子集评估数据 **注意**：评估时请使用`crpe_exist.jsonl`与`crpe_relation.jsonl`。评估脚本可参见[此处](https://github.com/OpenGVLab/all-seeing/blob/main/all-seeing-v2/llava/eval/eval_crpe.py)。如需了解更多细节，请参阅我们的[项目页面](https://github.com/OpenGVLab/all-seeing/all-seeing-v2)！ # 引用声明若您的研究中用到了本工作，请引用如下文献： BibTeX @article{wang2023allseeing, title={The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World}, author={Wang, Weiyun and Shi, Min and Li, Qingyun and Wang, Wenhai and Huang, Zhenhang and Xing, Linjie and Chen, Zhe and Li, Hao and Zhu, Xizhou and Cao, Zhiguo and others}, journal={arXiv preprint arXiv:2308.01907}, year={2023} } @article{wang2024allseeing_v2, title={The All-Seeing Project V2: Towards General Relation Comprehension of the Open World}, author={Wang, Weiyun and Ren, Yiming and Luo, Haowen and Li, Tiantong and Yan, Chenxiang and Chen, Zhe and Wang, Wenhai and Li, Qingyun and Lu, Lewei and Zhu, Xizhou and others}, journal={arXiv preprint arXiv:2402.19474}, year={2024} }

提供机构：

OpenGVLab

原始信息汇总

数据集概述

数据集名称

Circular-based Relation Probing Evaluation (CRPE)

数据集目的

用于量化评估模型的对象识别和关系理解能力。

数据集结构

四个分割：
- Existence：评估对象识别能力。
- Subject, Predicate, Object：评估关系理解能力，分别针对关系三元组中的元素进行探测。

数据集内容

文件列表：
- crpe_exist.jsonl：Existence分割的评估数据。
- crpe_exist_meta.jsonl：Existence分割的评估数据，不含CircularEval。
- crpe_relation.jsonl：Subject, Predicate, Object分割的评估数据。
- crpe_relation_meta.jsonl：Subject, Predicate, Object分割的评估数据，不含CircularEval。

评估方法

CircularEval：通过多次迭代和循环移位来评估模型的一致性。

异常数据

包含描绘现实中罕见关系三元组的图像，用于评估模型对语言先验的依赖。

使用建议

推荐使用crpe_exist.jsonl和crpe_relation.jsonl进行评估。

搜集汇总

数据集介绍

构建方式

CRPE数据集的构建基于CircularEval评估策略，旨在量化评估模型在对象识别与关系理解方面的能力。该数据集划分为四个子集：存在性（Existence）、主体（Subject）、谓词（Predicate）与对象（Object），分别针对对象识别及关系三元组（主体、谓词、对象）中的各个元素进行独立评估。数据集包含正常与异常的数据实例，后者用于评估模型对语言先验的依赖性。

使用方法

使用CRPE数据集进行评估时，研究者应使用`crpe_exist.jsonl`与`crpe_relation.jsonl`文件。评估脚本及相关使用方法可在指定的GitHub链接中找到。正确引用该数据集的相关研究论文对于研究成果的准确归功至关重要。

背景与挑战

背景概述

Circular-based Relation Probing Evaluation (CRPE) 数据集，由OpenGVLab团队开发，旨在定量评估模型在对象识别与关系理解方面的能力。该数据集创立于2023年，主要研究人员包括Weiyun Wang、Min Shi、Qingyun Li等，隶属于计算机视觉与自然语言处理领域。CRPE数据集的核心研究问题是提升模型对开放世界中对象及其关系的理解和识别能力。数据集采用CircularEval评估策略，对模型在对象存在性、主语、谓语和宾语四个方面的关系理解进行细致评价。CRPE的构建对相关领域产生了显著影响，推动了视觉识别与关系理解技术的发展。

当前挑战

CRPE数据集在构建和应用过程中面临的主要挑战包括：1) 如何精确地评估模型在对象识别与关系理解方面的能力，特别是在异常或罕见关系数据的处理上；2) 数据集的构建过程中，如何保证关系三元组（主语、谓语、宾语）的覆盖范围广泛且具有代表性；3) 采用CircularEval策略时，如何确保模型在不同迭代中保持一致性的正确预测。这些挑战对于提高数据集的质量和模型的评估准确性至关重要。

常用场景

经典使用场景

在人工智能领域，对象识别与关系理解是计算机视觉研究的关键技术。CRPE数据集作为一项评测基准，旨在量化评估模型在对象识别与关系理解方面的能力。其经典使用场景在于，通过单选题的形式，对模型在识别对象存在性以及理解关系三元组（主体、谓词、宾语）各元素的能力进行精确评测。

解决学术问题

CRPE数据集解决了学术研究中如何客观评估模型在视觉关系理解方面的性能问题，其通过四个不同的数据分割（存在性、主体、谓词、宾语）以及包含异常数据的语言依赖性评估，为研究者提供了一个全面且细致的评价框架，有助于推动视觉识别领域的研究进展。

实际应用

在实际应用中，CRPE数据集的评估策略被用于优化和提升计算机视觉模型的泛化能力，使其在真实世界的视觉场景中能够更加准确地识别对象并理解对象间的关系，从而提高智能系统的决策质量和效率。

数据集最近研究