five

AgiBotWorld-Alpha|机器人技术数据集|机器人操控数据集

收藏
魔搭社区2025-04-25 更新2025-01-11 收录
机器人技术
机器人操控
下载链接:
https://modelscope.cn/datasets/agibot-world/AgiBotWorld-Alpha
下载链接
链接失效反馈
资源简介:
--> # Key Features 🔑 - **1 million+** trajectories from 100 robots. - **100+ real-world scenarios** across 5 target domains. - **Cutting-edge hardware:** visual tactile sensors / 6-DoF dexterous hand / mobile dual-arm robots - **Tasks involving:** - Contact-rich manipulation - Long-horizon planning - Multi-robot collaboration Your browser does not support the video tag. Your browser does not support the video tag. Your browser does not support the video tag. # News 🌍 - **`[2025/1/20]`** AgiBot World Alpha released on OpenDataLab. [Download Link](https://opendatalab.com/OpenDataLab/AgiBot-World) - **`[2025/1/3]`** AgiBot World Alpha [**sample dataset**](sample_dataset.tar) released. - **`[2024/12/30]`** AgiBot World Alpha released. # TODO List 📅 - [x] **AgiBot World Beta**: ~1,000,000 trajectories of high-quality robot data (expected release date: Q1 2025) - [x] Complete language annotation of Alpha version (expected release data: Mid-January 2025) - [ ] **AgiBot World Colosseum**:Comprehensive platform (expected release date: 2025) - [ ] **2025 AgiBot World Challenge** (expected release date: 2025) # Table of Contents - [Key Features 🔑](#key-features-) - [News 🌍](#news-) - [TODO List 📅](#todo-list-) - [Get started 🔥](#get-started-) - [Download the Dataset](#download-the-dataset) - [Dataset Structure](#dataset-structure) - [Explanation of Proprioceptive State](#explanation-of-proprioceptive-state) - [Dataset Preprocessing](#dataset-preprocessing) - [License and Citation](#license-and-citation) # Get started 🔥 ## Download the Dataset To download the full dataset, you can use the following code. If you encounter any issues, please refer to the official Hugging Face documentation. ``` # Make sure you have git-lfs installed (https://git-lfs.com) git lfs install # When prompted for a password, use an access token with write permissions. # Generate one from your settings: https://huggingface.co/settings/tokens git clone https://huggingface.co/datasets/agibot-world/AgiBotWorld-Alpha # If you want to clone without large files - just their pointers GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/datasets/agibot-world/AgiBotWorld-Alpha ``` If you only want to download a specific task, such as `task_327`, you can use the following code. ``` # Make sure you have git-lfs installed (https://git-lfs.com) git lfs install # Initialize an empty Git repository git init AgiBotWorld-Alpha cd AgiBotWorld-Alpha # Set the remote repository git remote add origin https://huggingface.co/datasets/agibot-world/AgiBotWorld-Alpha # Enable sparse-checkout git sparse-checkout init # Specify the folders and files git sparse-checkout set observations/327 task_info/task_327.json scripts proprio_stats parameters # Pull the data git pull origin main ``` To facilitate the inspection of the dataset's internal structure and examples, we also provide a sample dataset, which is approximately 7 GB. Please refer to `sample_dataset.tar`. ## Dataset Preprocessing Our project relies solely on the [lerobot library](https://github.com/huggingface/lerobot) (dataset `v2.0`), please follow their [installation instructions](https://github.com/huggingface/lerobot?tab=readme-ov-file#installation). Here, we provide scripts for converting it to the lerobot format. **Note** that you need to replace `/path/to/agibotworld/alpha` and `/path/to/save/lerobot` with the actual path. ``` python scripts/convert_to_lerobot.py --src_path /path/to/agibotworld/alpha --task_id 352 --tgt_path /path/to/save/lerobot ``` We would like to express our gratitude to the developers of lerobot for their outstanding contributions to the open-source community. ## Dataset Structure ### Folder hierarchy ``` data ├── task_info │ ├── task_327.json │ ├── task_352.json │ └── ... ├── observations │ ├── 327 # This represents the task id. │ │ ├── 648642 # This represents the episode id. │ │ │ ├── depth # This is a folder containing depth information saved in PNG format. │ │ │ ├── videos # This is a folder containing videos from all camera perspectives. │ │ ├── 648649 │ │ │ └── ... │ │ └── ... │ ├── 352 │ │ ├── 648544 │ │ │ ├── depth │ │ │ ├── videos │ │ ├── 648564 │ │ │ └── ... │ └── ... ├── parameters │ ├── 327 │ │ ├── 648642 │ │ │ ├── camera │ │ ├── 648649 │ │ │ └── camera │ │ └── ... │ └── 352 │ ├── 648544 │ │ ├── camera # This contains all the cameras' intrinsic and extrinsic parameters. │ └── 648564 │ │ └── camera | └── ... ├── proprio_stats │ ├── 327[task_id] │ │ ├── 648642[episode_id] │ │ │ ├── proprio_stats.h5 # This file contains all the robot's proprioceptive information. │ │ ├── 648649 │ │ │ └── proprio_stats.h5 │ │ └── ... │ ├── 352[task_id] │ │ ├── 648544[episode_id] │ │ │ ├── proprio_stats.h5 │ │ └── 648564 │ │ └── proprio_stats.h5 │ └── ... ``` ### json file format In the `task_[id].json` file, we store the basic information of every episode along with the language instructions. Here, we will further explain several specific keywords. - **action_config**: The content corresponding to this key is a list composed of all **action slices** from the episode. Each action slice includes a start and end time, the corresponding atomic skill, and the language instruction. - **key_frame**: The content corresponding to this key consists of annotations for keyframes, including the start and end times of the keyframes and detailed descriptions. (`action_text` and `description` *are not available now, to be released by mid-January.*) ``` [ {"episode_id": 649078, "task_id": 327, "task_name": "Picking items in Supermarket", "init_scene_text": "The robot is in front of the fruit shelf in the supermarket.", "lable_info":{ "action_config":[ {"start_frame": 0, "end_frame": 435, "action_text": "Pick up onion from the shelf." "skill": "Pick" }, {"start_frame": 435, "end_frame": 619, "action_text": "Place onion into the plastic bag in the shopping cart." "skill": "Place" }, ... ] "key_frame": [ {"start": 0, "end": 435, "comment": "Failure recovery" } ] }, ... ] ``` ### h5 file format In the `proprio_stats.h5` file, we store all the robot's proprioceptive data. For more detailed information, please refer to the [explanation of proprioceptive state](#explanation-of-proprioceptive-state). ``` |-- timestamp |-- state |-- effector |-- force |-- position |-- end |-- angular |-- orientation |-- position |-- velocity |-- wrench |-- head |-- effort |-- position |-- velocity |-- joint |-- current_value |-- effort |-- position |-- velocity |-- robot |-- orientation |-- orientation_drift |-- position |-- position_drift |-- waist |-- effort |-- position |-- velocity |-- action |-- effector |-- force |-- index |-- position |-- end |-- orientation |-- position |-- head |-- effort |-- position |-- velocity |-- joint |-- effort |-- index |-- position |-- velocity |-- robot |-- index |-- orientation |-- position |-- velocity |-- waist |-- effort |-- position |-- velocity ``` ## Explanation of Proprioceptive State ### Terminology *The definitions and data ranges in this section may change with software and hardware version. Stay tuned.* **State and action** 1. State State refers to the monitoring information of different sensors and actuators. 2. Action Action refers to the instructions sent to the hardware abstraction layer, where controller would respond to these instructions. Therefore, there is a difference between the issued instructions and the actual executed state. **Actuators** 1. ***Effector:*** refers to the end effector, for example dexterous hands or grippers. 2. ***End:*** refers to the 6DoF end pose on the robot flange. 3. ***Head:*** refers to the robot's head perspective,which has two degrees of freedom (pitch and yaw). 4. ***Joint:*** refers to the joints of the robot's dual arms, with 14 degrees of freedom (7 DoF each). 5. ***Robot:*** refers to the robot's pose in its surrouding environment. The orientation and position refer to the robot's relative pose in the odometry coordinate system, where the origin is set since the robot is powered on. 6. ***Waist:*** refers to the joints of the robot's waist, which has two degrees of freedom (pitch and lift). ### Common fields 1. Position: Spatial position, encoder position, angle, etc. 2. Velocity: Speed 3. Angular: Angular velocity 4. Effort: Torque of the motor. Not available for now. 5. Wrench: Six-dimensional force, force in the xyz directions, and torque. Not available for now. ### Value shapes and ranges | Group | Shape | Meaning | | --- | :---- | :---- | | /timestamp | [N] | timestamp in nanoseconds | | /state/effector/position (gripper) | [N, 2] | left `[:, 0]`, right `[:, 1]`, gripper open range in mm | | /state/effector/position (dexhand) | [N, 12] | left `[:, :6]`, right `[:, 6:]`, joint angle in rad | | /state/end/orientation | [N, 2, 4] | left `[:, 0, :]`, right `[:, 1, :]`, flange quaternion with xyzw | | /state/end/position | [N, 2, 3] | left `[:, 0, :]`, right `[:, 1, :]`, flange xyz in meters | | /state/head/position | [N, 2] | yaw `[:, 0]`, pitch `[:, 1]`, rad | | /state/joint/current_value | [N, 14] | left arm `[:, :7]`, right arm `[:, 7:]` | | /state/joint/position | [N, 14] | left arm `[:, :7]`, right arm `[:, 7:]`, rad | | /state/robot/orientation | [N, 4] | quaternion in xyzw, yaw only | | /state/robot/position | [N, 3] | xyz position, where z is always 0 in meters | | /state/waist/position | [N, 2] | pitch `[:, 0]` in rad, lift `[:, 1]`in meters | | /action/*/index | [M] | actions indexes refer to when the control source is actually sending signals | | /action/effector/position (gripper) | [N, 2] | left `[:, 0]`, right `[:, 1]`, 0 for full open and 1 for full close | | /action/effector/position (dexhand) | [N, 12] | same as /state/effector/position | /action/effector/index | [M_1] | index when the control source for end effector is sending control signals | | /action/end/orientation | [N, 2, 4] | same as /state/end/orientation | | /action/end/position | [N, 2, 3] | same as /state/end/position | | /action/end/index | [M_2] | same as other indexes | | /action/head/position | [N, 2] | same as /state/head/position | | /action/head/index | [M_3] | same as other indexes | | /action/joint/position | [N, 14] | same as /state/joint/position | | /action/joint/index | [M_4] | same as other indexes | | /action/robot/velocity | [N, 2] | vel along x axis `[:, 0]`, yaw rate `[:, 1]` | | /action/robot/index | [M_5] | same as other indexes | | /action/waist/position | [N, 2] | same as /state/waist/position | | /action/waist/index | [M_6] | same as other indexes | # License and Citation All the data and code within this repo are under [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/). Please consider citing our project if it helps your research. ```BibTeX @misc{contributors2024agibotworldrepo, title={AgiBot World Colosseum}, author={AgiBot World Colosseum contributors}, howpublished={\url{https://github.com/OpenDriveLab/AgiBot-World}}, year={2024} } ```
提供机构:
maas
创建时间:
2024-12-30
用户留言
有没有相关的论文或文献参考?
这个数据集是基于什么背景创建的?
数据集的作者是谁?
能帮我联系到这个数据集的作者吗?
这个数据集如何下载?
点击留言
数据主题
具身智能
数据集  4098个
机构  8个
大模型
数据集  439个
机构  10个
无人机
数据集  37个
机构  6个
指令微调
数据集  36个
机构  6个
蛋白质结构
数据集  50个
机构  8个
空间智能
数据集  21个
机构  5个
5,000+
优质数据集
54 个
任务类型
进入经典数据集
热门数据集

典型分布式光伏出力预测数据集

光伏电站出力数据每5分钟从电站机房监控系统获取;气象实测数据从气象站获取,气象站建于电站30号箱变附近,每5分钟将采集的数据通过光纤传输到机房;数值天气预报数据利用中国电科院新能源气象应用机房的WRF业务系统(包括30TF计算刀片机、250TB并行存储)进行中尺度模式计算后输出预报产品,每日8点前通过反向隔离装置推送到电站内网预测系统。

国家基础学科公共科学数据中心 收录

LEGO数据集

该数据集包含了关于LEGO公司的历史、产品提供以及特定LEGO套装的信息,用于分析LEGO套装的规模、发布年份、主题分布以及套装复杂性的变化。

github 收录

LIDC-IDRI

LIDC-IDRI 数据集包含来自四位经验丰富的胸部放射科医师的病变注释。 LIDC-IDRI 包含来自 1010 名肺部患者的 1018 份低剂量肺部 CT。

OpenDataLab 收录

CE-CSL

CE-CSL数据集是由哈尔滨工程大学智能科学与工程学院创建的中文连续手语数据集,旨在解决现有数据集在复杂环境下的局限性。该数据集包含5,988个从日常生活场景中收集的连续手语视频片段,涵盖超过70种不同的复杂背景,确保了数据集的代表性和泛化能力。数据集的创建过程严格遵循实际应用导向,通过收集大量真实场景下的手语视频材料,覆盖了广泛的情境变化和环境复杂性。CE-CSL数据集主要应用于连续手语识别领域,旨在提高手语识别技术在复杂环境中的准确性和效率,促进聋人与听人社区之间的无障碍沟通。

arXiv 收录

Solar Radiation Data

该数据集包含全球多个地点的太阳辐射数据,涵盖了不同时间段和气象条件下的辐射强度。数据包括直接辐射、散射辐射和总辐射等指标,适用于太阳能资源评估和气候研究。

www.nrel.gov 收录