five

WanyueZhang/World2VLM

收藏
Hugging Face2026-04-30 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/WanyueZhang/World2VLM
下载链接
链接失效反馈
官方服务:
资源简介:
World2VLM数据集旨在通过世界模型作为教师模型,将空间想象力蒸馏到视觉语言模型(VLMs)中,从而增强其动态空间推理能力。该数据集使VLMs能够在推理时无需外部模拟的情况下,对未来的视图和动作结果进行推理。演示数据集包含基于轨迹的监督样本和8种动态空间推理任务类型的示例。数据集结构包括四个子集,涵盖不同的教师模型和场景类型。每个子集包含一个轨迹束和一个带有结构化监督的tasks_demo.jsonl文件。数据格式和任务套件详细说明了数据构造流程和数据集的关键特征。

The World2VLM dataset is designed to enhance Vision-Language Models (VLMs) with dynamic spatial reasoning capabilities by distilling spatial imagination into VLMs using world models as teachers. This enables VLMs to reason about future views and action consequences without external simulation at inference time. The demo dataset includes trajectory-based supervision samples and examples of 8 dynamic spatial reasoning task types. The dataset structure comprises four subsets covering different teacher models and scene types. Each subset includes a trajectory bundle and a tasks_demo.jsonl file with structured supervision. The data format and task suite are detailed, along with the data construction pipeline and key features of the dataset.
提供机构:
WanyueZhang
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作