AI2_Alphabot_2_arrange_teaset

Name: AI2_Alphabot_2_arrange_teaset
Creator: RoboCOIN
Published: 2026-06-01 20:44:40
License: 暂无描述

Hugging Face2026-06-01 更新2026-06-01 收录

下载链接：

https://huggingface.co/datasets/RoboCOIN/AI2_Alphabot_2_arrange_teaset

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集名为AI2_Alphabot_2_arrange_teaset，是一个机器人操作数据集，专注于整理茶具任务：从桌子上拿起玻璃杯并将其放置到托盘上。数据集基于LeRobot格式扩展，完全兼容LeRobot框架，包含399个episodes，总计318,301帧，数据量约为11.51 GB，帧率为30 FPS。使用的机器人是AI2_Alphabot_2，配备双指末端执行器，通过VR控制器遥操作。场景设置为餐饮->餐厅环境，涉及对象包括透明玻璃杯和棕色托盘。数据集提供四个RGB摄像头的视频数据：胸前前置摄像头、头部前置摄像头、左手腕摄像头和右手腕摄像头，分辨率均为640x480，使用AV1编解码器。此外，包含机器人的状态数据（如关节位置、末端执行器位姿）和动作数据，维度均为34维，并提供丰富的注释信息，如末端执行器加速度幅度、方向、速度以及夹爪活动和模式等。数据集主要用于机器人操作学习研究。

The dataset is named AI2_Alphabot_2_arrange_teaset, which is a robotic manipulation dataset focused on a specific tea set arrangement task: picking up a glass from a table and placing it on a tray. It is extended based on the LeRobot format and fully compatible with the LeRobot framework. The dataset contains 399 episodes, totaling 318,301 frames, with a data volume of approximately 11.51 GB and a frame rate of 30 FPS. The robot used is the AI2_Alphabot_2, equipped with a two-finger end effector and teleoperated via a VR controller. The scene is set in a dining->restaurant environment, involving objects such as a transparent glass and a brown tray. The dataset provides video data from four RGB cameras: a chest front camera, a head front camera, a left wrist camera, and a right wrist camera, all with a resolution of 640x480 and using the AV1 codec. Additionally, it includes robot state data (e.g., joint positions, end-effector poses) and action data, both with a dimension of 34, and offers rich annotations such as end-effector acceleration magnitude, orientation, velocity, as well as gripper activity and mode. The dataset is primarily used for robotic manipulation learning research.

提供机构：

RoboCOIN

创建时间：

2026-06-01

原始信息汇总

数据集概述

数据集名称: AI2_Alphabot_2_arrange_teaset
数据集主页: https://flagopen.github.io/RoboCOIN/
论文: https://arxiv.org/abs/2511.17441
仓库: https://github.com/FlagOpen/RoboCOIN
许可证: Apache-2.0

任务描述

主要任务: 从桌子上拿起玻璃杯，并将其放在托盘上。
操作类型: 固定单臂（fixed_single_arm）
环境类型: 真实世界（real_world）
原子动作: 拾取（pick）、放置（place）
子任务数量: 1 个

场景与物体

场景类型: 餐饮 -> 餐厅（Catering->Restaurant）
物体: 玻璃杯（透明）、托盘（棕色）

机器人配置

机器人名称: AI2_Alphabot_2
末端执行器类型: 两指末端执行器（two_finger_end_effector）
遥操作类型: VR控制器（vr_controller）
代码库版本: v2.1

传感器与相机

传感器: cam_front_chest_rgb, cam_front_head_rgb, cam_left_wrist_rgb, cam_right_wrist_rgb
相机信息:
- 所有相机分辨率均为 640x480，编码为 av1，像素格式为 yuv420p，FPS 为 30。

数据集统计

指标	值
总片段数	399
总帧数	318,301
总任务数	1
总视频数	1,596
总分块数	1
分块大小	1000
FPS	30
状态维度	34
动作维度	34
相机视角数	4
数据集大小	11.51 GB

数据划分

数据集仅包含一个训练集：片段 0 至 398。

数据集结构

数据集遵循 LeRobot 格式，包含以下组件：

数据文件: 包含状态、动作、元数据等，路径模式为 data/chunk-{id}/episode_{id}.parquet。
视频文件: 压缩的 RGB 相机观测视频，路径模式为 videos/chunk-{id}/observation.images.*/episode_{id}.mp{id}。
注释文件: 包含丰富的注释，如 eef_acc_mag_annotation.jsonl, eef_direction_annotation.jsonl, eef_velocity_annotation.jsonl, gripper_activity_annotation.jsonl, gripper_mode_annotation.jsonl。

特征（完整 YAML）

特征包括：

观测图像: 4 个 RGB 相机视图，每个形状为 (480, 640, 3)。
观测状态: 34 维浮点数组，包含左右臂关节位置、末端执行器位置与旋转、夹爪状态、颈部关节、躯干关节信息。
动作: 34 维浮点数组，结构与观测状态对称。
时间戳: 单精度浮点数。
帧索引: 64 位整数。
片段索引: 64 位整数。
索引: 64 位整数。
任务索引: 64 位整数。
夹爪开合比例: 2 维浮点数组（左右）。
末端执行器模拟位姿: 12 维浮点数组（左右）。
方向、速度、加速度、夹爪模式、夹爪活动状态: 均为 2 维整数数组（左右）。

贡献者

数据集由北京人工智能研究院（BAAI）的 RoboCOIN 团队贡献。

5,000+

优质数据集

54 个

任务类型

进入经典数据集