Airbot_MMK2_pick_up_and_place_tub

Name: Airbot_MMK2_pick_up_and_place_tub
Creator: RoboCOIN
Published: 2026-03-20 22:23:12
License: 暂无描述

Hugging Face2026-03-20 更新2026-03-20 收录

下载链接：

https://huggingface.co/datasets/RoboCOIN/Airbot_MMK2_pick_up_and_place_tub

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集使用基于LeRobot的扩展格式，并完全兼容LeRobot。它包含98个总片段，16632帧，帧率为30FPS，数据集大小为607.45 MB。机器人名称为Airbot_MMK2，末端执行器类型为五指夹爪。数据集包含四个摄像头视图：cam_head_rgb、cam_left_wrist_rgb、cam_right_wrist_rgb和cam_front_rgb。场景为家庭->浴室，对象为未知的洗脸盆。主要任务指令是“拿起洗脸盆并放下”。数据集包含9个不同的子任务和多个原子动作。数据集结构包括视频文件、状态数据、动作数据和元数据，组织成1个块，每个块大小为1000。

提供机构：

RoboCOIN

创建时间：

2026-03-20

原始信息汇总

Airbot_MMK2_pick_up_and_place_tub 数据集概述

数据集基本信息

数据集名称：Airbot_MMK2_pick_up_and_place_tub
任务类别：机器人学
语言：英语
许可协议：Apache-2.0
格式兼容性：基于LeRobot的扩展格式，完全兼容LeRobot。

数据集规模与统计

总情节数：98
总帧数：16632
总任务数：9
总视频数：392
总数据块数：1
数据块大小：1000
帧率：30 FPS
状态维度：36
动作维度：36
摄像机视角数：4
数据集大小：607.45 MB

任务与场景

主要任务指令：pick up the washbasin and put it down.
标准化任务描述：pick up the washbasin and put it down.
场景类型：household->bathroom
操作对象：washbasin(unknown)

子任务列表

该数据集包含9个不同的子任务：

End (索引：0)
Lift the basin with the left gripper (索引：1)
Grasp the basin with the left gripper (索引：2)
Grasp the basin with the right gripper (索引：3)
Place basin on the table with the right gripper (索引：4)
Place basin on the table with the left gripper (索引：5)
Abnormal (索引：6)
Lift the basin with the right gripper (索引：7)
null (索引：8)

原子动作

grasp
pick
place

机器人硬件配置

机器人名称：Airbot_MMK2
代码库版本：v2.1
末端执行器类型：five_finger_gripper
遥操作类型：由于某些原因，此数据集暂时无法提供遥操作类型信息。

传感器与摄像机

传感器列表：cam_head_rgb， cam_left_wrist_rgb， cam_right_wrist_rgb， cam_front_rgb
摄像机信息：
- cam_head_rgb：dtype=video， shape=480x640x3， resolution=640x480， codec=av1， pix_fmt=yuv420p
- cam_left_wrist_rgb：dtype=video， shape=480x640x3， resolution=640x480， codec=av1， pix_fmt=yuv420p
- cam_right_wrist_rgb：dtype=video， shape=480x640x3， resolution=640x480， codec=av1， pix_fmt=yuv420p
- cam_front_rgb：dtype=video， shape=480x640x3， resolution=640x480， codec=av1， pix_fmt=yuv420p

坐标系与单位

坐标系定义：right-hand-frame
关节旋转单位：radian
末端执行器旋转维度：end_rotation_dim
末端执行器平移维度：end_translation_dim

数据划分

训练集：情节 0:97

数据集结构

文件组织模式

数据文件路径模式：data/chunk-{id}/episode_{id}.parquet
视频文件路径模式：videos/chunk-{id}/observation.images.cam_left_wrist_rgb/episode_{id}.mp{id}

目录树结构

可用标注

数据集包含丰富的标注以支持多样化的学习方法：

eef_acc_mag_annotation.jsonl
eef_direction_annotation.jsonl
eef_velocity_annotation.jsonl
gripper_activity_annotation.jsonl
gripper_mode_annotation.jsonl
scene_annotations.jsonl
subtask_annotations.jsonl

数据特征

数据集包含以下主要特征（基于提供的YAML描述摘要）：

观测图像：来自四个摄像头的视频数据，形状为480x640x3，编码为AV1。
观测状态：36维浮点向量，包含左右臂及左右手各关节的角度。
动作：36维浮点向量，结构与观测状态对应。
时间戳、帧索引、情节索引等：辅助索引信息。
末端执行器状态与动作：包含位置、旋转、方向、速度、加速度大小等信息。

贡献者与链接

贡献者：北京智源人工智能研究院的RoboCOIN团队。
主页：https://flagopen.github.io/RoboCOIN/
论文：https://arxiv.org/abs/2511.17441
代码仓库：https://github.com/FlagOpen/RoboCOIN

使用条款与引用

访问条款：访问此数据集即表示您同意在您的研究/出版物中引用相关论文（详见“引用”部分）。您同意不使用该数据集进行对人类受试者造成伤害的实验。
主要引用：请引用相关的arXiv预印本论文。
附加引用：如果使用此数据集，请同时考虑引用LeRobot框架：https://github.com/huggingface/lerobot。

5,000+

优质数据集

54 个

任务类型

进入经典数据集