robomme_1cuben_allcases

Name: robomme_1cuben_allcases
Creator: alfayoung
Published: 2026-07-01 22:12:27
License: 暂无描述

Hugging Face2026-07-01 更新2026-07-01 收录

下载链接：

https://huggingface.co/datasets/alfayoung/robomme_1cuben_allcases

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集名为robomme_1cuben_allcases (VideoUnmaskSwap1CubeN)，专注于机器人视觉任务，场景涉及一个单色立方体被隐藏在三个杯子中的一个下方。杯子会以可变次数（0到3次，均匀分布）被打乱，机器人必须根据视频观察选择当前隐藏立方体的杯子。任务提示为无颜色的文本：仔细观看视频，然后拿起隐藏立方体的容器。采样设计包括120个独特的离散案例（基于3个初始位置和交换序列长度0到3），每个案例从对象+杯子初始姿势、对象颜色（红、绿、蓝）和恢复模式（无、z、xy）中抽取3次，共生成360个片段（总计112978帧）。恢复模式的权重为[0.7, 0.15, 0.15]，约30%的片段包含故意的抓取失败和恢复过程。所有片段都经过深度验证，确保每个案例可准确重现。数据集采用标准LeRobot v2.1布局，使用panda机器人，帧率为10 fps，未包含预计算的VAE潜在表示。

The dataset named robomme_1cuben_allcases (VideoUnmaskSwap1CubeN) focuses on a robot vision task where a monochrome cube is hidden under one of three cups. The cups are shuffled a variable number of times (uniformly distributed from 0 to 3 swaps), and the robot must select the cup currently hiding the cube based on video observation. The task prompt is colorless: Watch the video carefully, then pick up the container hiding the cube. The sampling design includes 120 unique discrete cases (3 initial positions × swap sequence lengths from 0 to 3), with each case sampled 3 times from (object + cup initial pose [seed] × object color {red, green, blue} × recovery mode {none, z, xy}), resulting in 360 clips (112978 frames total). Recovery modes are weighted as [0.7, 0.15, 0.15], with approximately 30% of clips containing intentional grasp failures and recovery processes. All clips are deeply validated to ensure accurate reproducibility for each case. The dataset follows the standard LeRobot v2.1 layout, using a panda robot at 10 fps, and does not include pre-computed VAE latent representations.

提供机构：

alfayoung

创建时间：

2026-06-30

原始信息汇总