rajeshramana/leisaac-pick-orange-prepared

Name: rajeshramana/leisaac-pick-orange-prepared
Creator: rajeshramana
Published: 2026-04-08 06:22:07
License: 暂无描述

Hugging Face2026-04-08 更新2026-04-12 收录

下载链接：

https://hf-mirror.com/datasets/rajeshramana/leisaac-pick-orange-prepared

下载链接

链接失效反馈

官方服务：

资源简介：

--- language: - en license: apache-2.0 tags: - robotics - isaac-lab - manipulation - lerobot - so101 - imitation-learning task_categories: - robotics size_categories: - 10K<n<100K --- # LeIsaac PickOrange — Prepared Dataset (GR00T-ready) Pre-processed version of [LightwheelAI/leisaac-pick-orange](https://huggingface.co/datasets/LightwheelAI/leisaac-pick-orange) ready for GR00T N1.6 fine-tuning. ## What's Different from the Original? The original dataset requires several modifications to work with GR00T N1.6. This version has all fixes pre-applied: | Fix | Original | This Dataset | |-----|----------|-------------| | Video codec | AV1 | **H.264** (decord-compatible) | | Annotation column | Missing | **Integer task_index** added to all parquet files | | modality.json | Missing | **Included** (state/action/video/annotation) | | Format | LeRobot v2.1 | LeRobot v2.1 (compatible) | ## Dataset Details - **Task**: LeIsaac-SO101-PickOrange-v0 (kitchen scene, pick 3 oranges → plate) - **Robot**: SO-101 follower (5 arm joints + 1 gripper) - **Episodes**: 60 teleoperation demonstrations - **Frames**: ~36,000 total - **FPS**: 30 - **Cameras**: Front (480x640) + Wrist (480x640) - **Actions**: 6D absolute joint positions in degrees ### Action Space ``` Index | Joint | Range (deg) ------|----------------|------------- 0 | shoulder_pan | [-38, 52] 1 | shoulder_lift | [-100, 64] 2 | elbow_flex | [-99, 99] 3 | wrist_flex | [22, 100] 4 | wrist_roll | [-14, 51] 5 | gripper | [1, 91] ``` ### modality.json ```json { "state": { "single_arm": {"start": 0, "end": 5}, "gripper": {"start": 5, "end": 6} }, "action": { "single_arm": {"start": 0, "end": 5}, "gripper": {"start": 5, "end": 6} }, "video": { "front": {"original_key": "observation.images.front"}, "wrist": {"original_key": "observation.images.wrist"} }, "annotation": { "human.task_description": {"original_key": "annotation.human.task_description"} } } ``` ## Usage with GR00T ```bash # 1. Clone GR00T git clone https://github.com/NVIDIA/Isaac-GR00T.git # 2. Download this dataset huggingface-cli download rajeshramana/leisaac-pick-orange-prepared \ --local-dir ./demo_data/pick_orange --repo-type dataset # 3. Fine-tune python gr00t/experiment/launch_finetune.py \ --base-model-path nvidia/GR00T-N1.6-3B \ --dataset-path ./demo_data/pick_orange \ --modality-config-path ./so101_pick_orange_config.py \ --embodiment-tag NEW_EMBODIMENT \ --num-gpus 1 \ --max-steps 10000 \ --no-tune-diffusion-model ``` ## Trained Model A model fine-tuned on this dataset for 10K steps (loss 0.017) is available at: [rajeshramana/groot-n1.6-pick-orange](https://huggingface.co/rajeshramana/groot-n1.6-pick-orange) ## Pre-processing Steps Applied 1. **Re-encoded all 120 videos** from AV1 → H.264: ```bash ffmpeg -y -i input.mp4 -c:v libx264 -crf 23 -preset fast -pix_fmt yuv420p output.mp4 ``` 2. **Added annotation column** to all 60 episode parquet files: ```python df["annotation.human.task_description"] = 0 # integer task_index ``` 3. **Created modality.json** mapping flat columns to GR00T's expected modality structure ## Source - **Original dataset**: [LightwheelAI/leisaac-pick-orange](https://huggingface.co/datasets/LightwheelAI/leisaac-pick-orange) - **Framework**: [LightwheelAI/leisaac](https://github.com/LightwheelAI/leisaac) - **Simulation**: NVIDIA Isaac Lab 2.3.2 ## Citation ```bibtex @misc{leisaac-pick-orange-prepared, title={LeIsaac PickOrange Prepared Dataset for GR00T}, author={Rajesh Kumar}, year={2026}, url={https://huggingface.co/datasets/rajeshramana/leisaac-pick-orange-prepared} } ```

语言： - en 许可证：apache-2.0 标签： - 机器人学 - Isaac Lab - 操作任务 - LeRobot - SO101 - 模仿学习（imitation learning）任务类别： - 机器人学规模类别： - 10K<n<100K # LeIsaac PickOrange — 预适配GR00T的预处理数据集本数据集为[LightwheelAI/leisaac-pick-orange](https://huggingface.co/datasets/LightwheelAI/leisaac-pick-orange)的预处理版本，已适配GR00T N1.6微调任务。 ## 与原始数据集的差异原始数据集需经过多项修改才能适配GR00T N1.6，本版本已预先完成所有修复： | 修复项 | 原始数据集 | 本数据集 | |-----|----------|-------------| | 视频编码格式 | AV1 | **H.264**（兼容decord） | | 标注列 | 缺失 | 所有Parquet文件均已添加整型`task_index`（任务索引）列 | | 模态配置文件（modality.json） | 缺失 | 已包含（包含状态、动作、视频、标注四类模态） | | 格式 | LeRobot v2.1 | 兼容LeRobot v2.1格式 | ## 数据集详情 - **任务**：LeIsaac-SO101-PickOrange-v0（厨房场景，拾取3个橙子至餐盘） - **机器人**：SO-101从动机械臂（5个臂关节+1个夹爪） - **演示片段**：60次遥操作演示 - **总帧数**：约36,000 - **帧率**：30 FPS - **相机配置**：前方相机（分辨率480×640）+ 腕部相机（分辨率480×640） - **动作空间**：6维绝对关节角度（单位：度） ### 动作空间索引 | 关节名称 | 角度范围（度） ------|----------------|------------- 0 | shoulder_pan（肩部平转关节） | [-38, 52] 1 | shoulder_lift（肩部抬升关节） | [-100, 64] 2 | elbow_flex（肘部屈伸关节） | [-99, 99] 3 | wrist_flex（腕部屈伸关节） | [22, 100] 4 | wrist_roll（腕部旋转关节） | [-14, 51] 5 | gripper（夹爪） | [1, 91] ### 模态配置文件（modality.json） json { "state": { "single_arm": {"start": 0, "end": 5}, "gripper": {"start": 5, "end": 6} }, "action": { "single_arm": {"start": 0, "end": 5}, "gripper": {"start": 5, "end": 6} }, "video": { "front": {"original_key": "observation.images.front"}, "wrist": {"original_key": "observation.images.wrist"} }, "annotation": { "human.task_description": {"original_key": "annotation.human.task_description"} } } ## GR00T使用方法 bash # 1. 克隆GR00T仓库 git clone https://github.com/NVIDIA/Isaac-GR00T.git # 2. 下载本数据集 huggingface-cli download rajeshramana/leisaac-pick-orange-prepared --local-dir ./demo_data/pick_orange --repo-type dataset # 3. 启动微调任务 python gr00t/experiment/launch_finetune.py --base-model-path nvidia/GR00T-N1.6-3B --dataset-path ./demo_data/pick_orange --modality-config-path ./so101_pick_orange_config.py --embodiment-tag NEW_EMBODIMENT --num-gpus 1 --max-steps 10000 --no-tune-diffusion-model ## 已训练模型本数据集经10,000步微调（损失值为0.017）的模型已上传至：[rajeshramana/groot-n1.6-pick-orange](https://huggingface.co/rajeshramana/groot-n1.6-pick-orange) ## 预处理步骤 1. **重编码所有120段视频**：将原AV1编码转换为H.264格式，使用命令： bash ffmpeg -y -i input.mp4 -c:v libx264 -crf 23 -preset fast -pix_fmt yuv420p output.mp4 2. **为全部60个演示片段的Parquet文件添加标注列**： python df["annotation.human.task_description"] = 0 # 整型任务索引task_index 3. **生成模态配置文件modality.json**：将扁平化的数据列映射至GR00T要求的模态结构。 ## 数据集来源 - 原始数据集：[LightwheelAI/leisaac-pick-orange](https://huggingface.co/datasets/LightwheelAI/leisaac-pick-orange) - 开发框架：[LightwheelAI/leisaac](https://github.com/LightwheelAI/leisaac) - 仿真环境：NVIDIA Isaac Lab 2.3.2 ## 引用格式 bibtex @misc{leisaac-pick-orange-prepared, title={LeIsaac PickOrange Prepared Dataset for GR00T}, author={Rajesh Kumar}, year={2026}, url={https://huggingface.co/datasets/rajeshramana/leisaac-pick-orange-prepared} }

提供机构：

rajeshramana

5,000+

优质数据集

54 个

任务类型

进入经典数据集