so101_pick_place

Name: so101_pick_place
Creator: aiki253
Published: 2026-05-12 21:44:17
License: 暂无描述

Hugging Face2026-05-12 更新2026-05-12 收录

下载链接：

https://huggingface.co/datasets/aiki253/so101_pick_place

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是使用LeRobot工具创建的，专注于机器人学领域，具体涉及拾取和放置任务。数据集包含26个完整的情节（episodes），总计17845帧数据，帧率为每秒30帧。数据以Parquet文件格式存储，总数据文件大小为100MB，视频文件大小为200MB。数据集的结构包括多个特征：动作（action）包含6个浮点数值，对应机器人的关节位置（如肩部平移、肩部升降、肘部弯曲、腕部弯曲、腕部旋转和夹爪位置）；观察状态（observation.state）同样包含6个浮点数值，表示机器人的当前状态；观察图像（observation.images.front）为视频数据，分辨率为480x640像素，3个颜色通道，使用AV1编解码器，无音频。此外，还包括时间戳、帧索引、情节索引、索引和任务索引等元数据。数据集仅用于训练，涵盖一个任务类型，机器人类型为so_follower。该数据集适用于机器人控制、视觉导航和强化学习等研究。

This dataset is created using the LeRobot tool, focusing on the field of robotics, specifically involving pick-and-place tasks. It contains 26 complete episodes, totaling 17,845 frames of data at a frame rate of 30 frames per second. The data is stored in Parquet file format, with a total data file size of 100MB and a video file size of 200MB. The dataset structure includes multiple features: action (6 floating-point values corresponding to robot joint positions, such as shoulder translation, shoulder elevation, elbow flexion, wrist flexion, wrist rotation, and gripper position); observation.state (also 6 floating-point values representing the robots current state); observation.images.front (video data with a resolution of 480x640 pixels, 3 color channels, using the AV1 codec, and no audio). Additionally, it includes metadata such as timestamps, frame indices, episode indices, indices, and task indices. The dataset is intended for training only, covering a single task type, with the robot type being so_follower. It is suitable for research in robot control, visual navigation, and reinforcement learning.

提供机构：

aiki253

创建时间：

2026-05-12

原始信息汇总

数据集概述：so101_pick_place

基本信息

许可证：Apache-2.0
任务类别：机器人学（Robotics）
机器人类型：so_follower
创建工具：LeRobot

数据集规模

总轮次（Episodes）：26
总帧数：17,845
任务数量：1
帧率（FPS）：30
数据文件大小：约100 MB
视频文件大小：约200 MB
块大小（Chunks size）：1,000

数据集划分

训练集（Train）：0至25轮次（共26轮次）

数据结构与特征

动作（action）

数据类型：float32
形状：[6]
字段含义：肩关节、肘关节、腕关节和夹爪的位置信息，包含以下6个维度：
- shoulder_pan.pos
- shoulder_lift.pos
- elbow_flex.pos
- wrist_flex.pos
- wrist_roll.pos
- gripper.pos

观察-状态（observation.state）

数据类型：float32
形状：[6]
字段含义：与动作（action）相同，包含6个关节位置信息。

观察-图像（observation.images.front）

数据类型：视频
形状：[480, 640, 3]
维度含义：高度（480）、宽度（640）、通道数（3）
视频参数：
- 编码格式：av1
- 像素格式：yuv420p
- 是否为深度图：否
- 帧率：30 FPS
- 是否有音频：否

其他特征

timestamp：时间戳，float32，形状[1]
frame_index：帧索引，int64，形状[1]
episode_index：轮次索引，int64，形状[1]
index：全局索引，int64，形状[1]
task_index：任务索引，int64，形状[1]

数据集路径

数据文件路径：data/chunk-{chunk_index:03d}/file-{file_index:03d}.parquet
视频文件路径：videos/{video_key}/chunk-{chunk_index:03d}/file-{file_index:03d}.mp4

5,000+

优质数据集

54 个

任务类型

进入经典数据集