princeton-vl/vlmtunnel

Name: princeton-vl/vlmtunnel
Creator: princeton-vl
Published: 2025-12-15 14:59:49
License: 暂无描述

Hugging Face2025-12-15 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/princeton-vl/vlmtunnel

下载链接

链接失效反馈

官方服务：

资源简介：

VLM Tunnel数据集是一个用于评估视觉语言模型（VLMs）在非局部视觉推理方面能力的基准数据集。它包含三个主要任务：电路追踪（Circuits）、视觉寻宝（Visual Scavenger Hunt）和物体重识别（Object Re-Identification）。数据集通过GitHub上的代码生成，主要用于研究和评估视觉推理。每个任务都有详细的描述和数据格式说明，包括任务类型、变体、提示、答案和图像等字段。数据集的设计旨在测试VLMs在复杂视觉场景中的推理能力。

The VLM Tunnel Dataset is a benchmark used to evaluate the nonlocal visual reasoning capabilities of Vision-Language Models (VLMs). It includes three main tasks: Circuits (wire tracing), Visual Scavenger Hunt (chain following), and Object Re-Identification (pair matching). The dataset is generated using code from GitHub and is intended for research and evaluation of visual reasoning. Each task is described in detail, with data format specifications including task type, variant, prompt, answer, and images. The dataset is designed to test VLMs reasoning abilities in complex visual scenarios.

提供机构：

princeton-vl

5,000+

优质数据集

54 个

任务类型

进入经典数据集