five

TESS-Computer/minecraft-vla-stage1

收藏
Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/TESS-Computer/minecraft-vla-stage1
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含来自Minecraft游戏的帧-动作对,专为遵循Lumine方法论的VLA模型训练而设计。数据集基于OpenAI VPT承包商数据的7.x子集,包含约17,886个视频(约330小时的早期游戏内容),聚焦于新世界创建后的前30分钟游戏内容。每个样本包含640x360 JPEG图像帧、视频ID、帧索引和Lumine格式的动作字符串。动作字符串包含鼠标移动、滚动(始终为0)和按键组合信息。数据集经过处理,帧率从20FPS降采样至5FPS,移除了空闲帧和加载画面。这是三阶段训练流程的第一阶段,专注于动作预训练(学习观察→动作映射)。

This dataset contains frame-action pairs from Minecraft gameplay, designed for training VLA models following the Lumine methodology. Processed from OpenAIs VPT contractor dataset (7.x subset), it includes ~17,886 videos (~330 hours of early-game gameplay) focusing on the first 30 minutes of new worlds. Each sample contains a 640x360 JPEG frame, video identifier, frame index, and Lumine-format action string. The action string includes mouse movements, scroll (always 0), and key combinations. The data is processed to 5 FPS (downsampled from VPTs 20 FPS) with idle frames and loading screens filtered out. This is Stage 1 of a 3-stage training pipeline focused on action pretraining (learning observation→action mapping).
提供机构:
TESS-Computer
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作