VincentNi/robotwin-blocks-ranking-rgb-rollouts

Name: VincentNi/robotwin-blocks-ranking-rgb-rollouts
Creator: VincentNi
Published: 2026-04-20 19:43:33
License: 暂无描述

Hugging Face2026-04-20 更新2026-04-26 收录

下载链接：

https://hf-mirror.com/datasets/VincentNi/robotwin-blocks-ranking-rgb-rollouts

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: apache-2.0 language: - en tags: - robotics - video-generation - robotwin - wan22 - ti2v - reward-model size_categories: - n<1K --- # RoboTwin blocks_ranking_rgb — Wan2.2 TI2V Rollouts 160 text+image-to-video rollouts (10 initial conditions × 16 random seeds) for the **blocks_ranking_rgb** task from RoboTwin, generated with the **Wan2.2 TI2V (5B)** diffusion model fine-tuned with a merged Vidar LoRA adapter, and scored with the `blocks_ranking_v2` reward (SAM3 object tracking + IDM inverse-dynamics + FK gripper ↔ block position matching). Companion to the [EmbodiedVideoRL / DanceGRPO](https://arxiv.org/abs/2505.07818) reward-model work. ## Dataset summary | | value | |--------|-------| | Task | `blocks_ranking_rgb` (RoboTwin) — arrange R/G/B blocks left-to-right | | Scenes | 10 initial conditions | | Rollouts / scene | 16 (seeds 42–57) | | Total videos | 160 | | Frame count | 121 @ 16 fps | | Resolution | 640 × 736 | | Generator | Wan2.2-TI2V-5B + merged Vidar LoRA | | Sampler | 20-step SDE (eta=1.0, CFG=5.0, shift=5.0) | | Reward | `blocks_ranking_v2` — FK pick/place + left-to-right order + no-duplication | **Overall pass rate: 108 / 160 = 67.5 %** Per-scene: | scene id | pass / total | |----------|--------------| | 123500000 | 16 / 16 | | 123500001 | 4 / 16 | | 123500002 | 9 / 16 | | 123500003 | 12 / 16 | | 123500004 | 8 / 16 | | 123500005 | 15 / 16 | | 123500006 | 3 / 16 | | 123500007 | 12 / 16 | | 123500008 | 16 / 16 | | 123500009 | 13 / 16 | ## Files For each rollout with stem `{scene}_g{GGG}_s{SEED}`: | file | content | |------|---------| | `{stem}.mp4` | raw generated video (121 frames, 640×736, libx264) | | `{stem}_{CLEAN,FAIL}_ms{motion_score}.mp4` | annotated reward-debug video: SAM3 boxes per color, per-frame object count, FK gripper trails (left=cyan, right=magenta), pick/place match overlay, judge verdict | | `{stem}_{CLEAN,FAIL}.txt` | one-line reward summary with failure reasons | | `{stem}_extraction.json` | SAM3 tracking evidence — per-frame `obj_ids`, `probs`, `boxes_xywh` (xywh normalized 0–1), `num_tracked` for each of {red, green, blue} block prompts; plus `motion_score`, `max_jump`, `grip_changes`, `total_frames`, `fps`. Per-frame pixel masks have been stripped to keep the dataset compact — they are deterministically reproducible from SAM3 if needed. | | `{stem}_idm_actions.npy` | IDM-inferred 14-DOF action trajectory `(T, 14)` — layout `[left_arm(6), left_gripper(1), right_arm(6), right_gripper(1)]` | | `summary.json` | merged per-video summary (reward, criteria pass/fail, motion_score, max_jump, grip_changes, timings) | ## Reward criteria (`blocks_ranking_v2`) A rollout is labelled `CLEAN` (reward=1.0) iff all of: 1. **grip_changes == 6** — exactly 3 close + 3 open cycles detected by IDM (one per block) 2. **close-match OK** — every gripper-close FK position matches an *initial* block position (pick, thr=0.05) 3. **open-match OK** — every gripper-open FK position matches a *final* block position (place, thr=0.08) 4. **order OK** — final block order R < G < B along cx (left-to-right, margin=0.05) 5. **no duplication** — block count per color never exceeds 1 throughout the trajectory Otherwise `FAIL` (reward=0.0) and `_FAIL.txt` lists the triggered reasons. ## Loading ```python import json, numpy as np from huggingface_hub import hf_hub_download summary = json.load(open(hf_hub_download( "VincentNi/robotwin-blocks-ranking-rgb-rollouts", "summary.json", repo_type="dataset", ))) row = summary[0] stem = row["video"][:-4] # drop .mp4 ext = json.load(open(hf_hub_download( "VincentNi/robotwin-blocks-ranking-rgb-rollouts", f"{stem}_extraction.json", repo_type="dataset", ))) idm = np.load(hf_hub_download( "VincentNi/robotwin-blocks-ranking-rgb-rollouts", f"{stem}_idm_actions.npy", repo_type="dataset", )) ``` ## Generation command ```bash bash scripts/inference/rollout_score_blocks_ranking_rgb.sh # equivalently: 8 × H100, torchrun --nproc_per_node=8, # --num_rollouts 16 --seed 42 --eta 1.0 --sample_steps 20 --sample_guide_scale 5.0 # --reward_config blocks_ranking_v2 ```

提供机构：

VincentNi

5,000+

优质数据集

54 个

任务类型

进入经典数据集