GSCAN

Name: GSCAN
Creator: Laura Ruis
License: 暂无描述

arXiv2025-09-30 收录

下载链接：

https://github.com/LauraRuis/multimodal_seq2seq_gSCAN

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集名为gSCAN，是一个评估情境语言理解中组合性泛化能力的基准，主要研究代理如何在网格世界环境中解释和执行命令。该数据集包含了训练和测试集，要求进行不同形式的语言泛化，包括对新颖的颜色-形状组合进行零样本泛化。数据集规模为19,282个测试示例，其任务旨在评估在接地语言理解中的系统性泛化能力。

This dataset, named gSCAN, is a benchmark for evaluating compositional generalization in situated language understanding. It primarily investigates how agents interpret and execute commands in grid-world environments. The dataset comprises training and test splits, with tasks requiring diverse forms of language generalization including zero-shot generalization to novel color-shape combinations. It contains 19,282 test examples, and its tasks aim to evaluate systematic generalization in grounded language understanding.

提供机构：

Laura Ruis

5,000+

优质数据集

54 个

任务类型

进入经典数据集