EmbSpatial-Bench

arXiv2025-09-30 收录

下载链接：

https://arxiv.org/pdf/2406.05756

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是一个用于评估大型视觉-语言模型（LVLMs）在具身空间理解方面的基准，它源自于覆盖了从第一人称视角的具身场景中提取的6种空间关系。该任务旨在评估在具身体验环境中对空间理解的能力。

This dataset is a benchmark for evaluating Large Vision-Language Models (LVLMs) on embodied spatial understanding. It consists of six types of spatial relations extracted from embodied scenes captured from a first-person perspective. This task is designed to assess the capability of spatial comprehension in embodied experience environments.

5,000+

优质数据集

54 个

任务类型

进入经典数据集