10Kh-RealOmin-OpenData

Name: 10Kh-RealOmin-OpenData
Creator: maas
Published: 2026-05-22 17:22:36
License: 暂无描述

魔搭社区2026-05-22 更新2026-01-10 收录

下载链接：

https://modelscope.cn/datasets/GenRobot.AI/10Kh-RealOmin-OpenData

下载链接

链接失效反馈

官方服务：

资源简介：

Boasting over 10,000 hours of cumulative data and 1 million+ clips, it ranks as the largest open-source embodied intelligence dataset in the industry. # Compared with other datasets, it has the following advantages: 1. Ample Data Volume & Strong Generalization Each skill is supported by sufficient data, collected from over 3,000 households and nearly 10,000 distinct fine-grained targets. It avoids simple repetitions and ensures robust generalization. 2. Authentic Scenarios & Focused Skills Captured from natural operations in real households, we avoid skill fragmentation that compromises quality. Instead, we focus on 10 key household scenarios and 30 core skills. 3. Bimanual & Long-duration Tasks Full recordings of the entire process of complex household chores and cleaning. Data collection by GenDAS Gripper. 4. Multi-modal & High-quality Data Includes large-FOV raw images, trajectories, annotations and joint movements. Trajectory reconstruction ensures industry-leading precision and quality. # Dataset Statistics | Attribute | Value | |-------------------------|----------------------| | Median Clip Length | 210.0 seconds | | Storage Size | 95 TB | | Format | mcap | | Resolution | 1600*1296 | | Frame Rate | 30 fps | | Camera Type | Large FOV Fisheye Camera | | IMU | Yes 6-axis | | Tactile Array Spatial | Yes | | Array Spatial Resolution | 1 mm | | Device |Gen DAS Gripper | # Stage 1 Content： **We have uploaded the data of Stage 1. This is only a small fraction and we will complete updates for the remaining skills as soon as possible.** Stage 1 covers 12 skills across 4 major scenario tasks .Total duration: 950 hours, clips: 39,761, storage 3.45TB. | Task | Skill | |-------------------------------|--------------------------------| | Folding_Clothes_and_Zipper_Operations | fold_and_store_clothes | | | zip_clothes | | Cooking_and_Kitchen_Clean | clean_container | | | unscrew_bottle_cap_and_pour | | | clean_bowl | | Organize_Clutter | desktop_object_sorting | | | fold_towel | | | fold_and_store_shopping_bag | | | drawer_to_take_items | | | drawer_to_place_items | | Shoes_Handling | lace_up_shoes_with_both_hands | | | organize_scattered_shoes | And synchronize the progress across major social platforms.In addition to the data, we will also provide relevant support including format conversion and usage guidance，here is the link GitHub - genrobot-ai/das-datakit. # Contact Us Any questions, suggestions or desired data collection scenarios/skills are welcome during usage. Let’s co-build this project to digitize all human skills. X：https://x.com/GenrobotAI Linkin：https://www.linkedin.com/company/108767412/admin/dashboard/ Email：opendata@genrobot.ai # Dataset Structure ``` The mcap files are stored in the final leaf folders of the file directory structure.Note: Each mcap file represents one piece of task data. ``` # Data Format Dual-arm tasks: robot0 and robot1 represent the left and right grippers respectively. Each gripper contains the following topics: ``` python /robot0/sensor/camera0/compressed # Fisheye camera image data — compressed and encoded in H.264 format. /robot0/sensor/camera0/camera_info # Fisheye Intrinsic and Extrinsic Parameters /robot0/sensor/imu # Inertial Measurement Unit (IMU) Data /robot0/sensor/magnetic_encoder # Magnetic encoder data: gripper opening distance /robot0/vio/eef_pose # Trajectory data ``` Topics are serialized using Protobuf for persistent storage /robot0/sensor/camera0/compressed: ``` protobuf // A compressed image message CompressedImage { // Timestamp of image google.protobuf.Timestamp timestamp = 1; // frame id string frame_id = 4; // Compressed image data, h264 video stream bytes data = 2; // Image format // Supported values: `webp`, `jpeg`, `png`, `h264` string format = 3; // common header, timestamp is inside it Header header = 8; } message Header { string module_name = 1; uint32 sequence_num = 2; uint64 timestamp = 3; string topic_name = 4; double expect_hz = 5; repeated Input inputs = 6; } ``` /robot0/sensor/camera0/camera_info: ``` protobuf // Camera calibration parameters message CameraCalibration { // not used google.protobuf.Timestamp timestamp = 1; // frame id string frame_id = 9; // Image width fixed32 width = 2; // Image height fixed32 height = 3; // Name of distortion model string distortion_model = 4; // Distortion parameters repeated double D = 5; // Intrinsic camera matrix (3x3 row-major matrix) // // A 3x3 row-major matrix for the raw (distorted) image. // // Projects 3D points in the camera coordinate frame to 2D pixel coordinates using the focal lengths (fx, fy) and principal point (cx, cy). // // ``` // [fx 0 cx] // K = [ 0 fy cy] // [ 0 0 1] // ``` repeated double K = 6; // length 9 // Rectification matrix (stereo cameras only, 3x3 row-major matrix) // // A rotation matrix aligning the camera coordinate system to the ideal stereo image plane so that epipolar lines in both stereo images are parallel. repeated double R = 7; // length 9 // Projection/camera matrix (stereo cameras only, 3x4 row-major matrix) // [fx' 0 cx' Tx] // P = [ 0 fy' cy' Ty] // [ 0 0 1 0] repeated double P = 8; // length 12 // transform from camera to base frame repeated double T_b_c = 10; // length 7, [tx ty tz qx qy qz qw] // common header Header header = 11; } ``` /robot0/sensor/imu: ``` protobuf // IMU message message IMUMeasurement { // common header arnold.common.proto.Header header = 1; // frame id string frame_id = 2; foxglove.Vector3 angular_velocity = 3; // Acceleration data in g-force units foxglove.Vector3 linear_acceleration = 4; // float temperature = 5; // repeated float angular_velocity_covariance = 6; // repeated float linear_acceleration_covariance = 7; } ``` /robot0/sensor/magnetic_encoder: ``` protobuf message MagneticEncoderMeasurement { // common header arnold.common.proto.Header header = 1; // frame id string frame_id = 2; // Distance between gripper fingers, 0-0.103m, 0 means closed double value = 3; } ``` /robot0/vio/eef_pose: ``` protobuf // A timestamped pose for an object or reference frame in 3D space message PoseInFrame { // not used google.protobuf.Timestamp timestamp = 1; // Frame id string frame_id = 2; // Pose in 3D space foxglove.Pose pose = 3; // linear vel foxglove.Vector3 linear_vel= 4; // angular_vel foxglove.Vector3 angular_vel = 5; // common header arnold.common.proto.Header header = 6; } ``` # How to Vis Data web view tool； ``` https://monitor.genrobot.click/#/index ``` # How to Load Data reference： ``` GitHub - genrobot-ai/das-datakit

本数据集累计数据时长超10000小时，包含百万余条片段，是业内规模最大的开源具身智能（embodied intelligence）数据集。 # 相较于其他同类数据集，本数据集具备以下优势： 1. 数据体量充足，泛化能力强劲每项技能均配有足量数据，采集自3000余户家庭与近10000个差异化细粒度目标，避免了简单重复，确保模型具备优秀的泛化性能。 2. 场景真实，技能聚焦数据采集自真实家庭中的自然操作场景，规避了破坏数据质量的技能碎片化问题，本次数据集聚焦10大核心家居场景与30项关键技能。 3. 支持双手操作与长时任务完整记录复杂家居劳作与清洁流程的全过程，数据由GenDAS机械手（GenDAS Gripper）采集。 4. 多模态且高质量包含大视场原始图像、轨迹、标注与关节运动数据。通过轨迹重构技术，实现了行业领先的精度与数据质量。 # 数据集统计信息 | 属性 | 数值 | |-------------------------|----------------------| | 片段中位时长 | 210.0 秒 | | 存储容量 | 95 TB | | 存储格式 | mcap | | 分辨率 | 1600*1296 | | 帧率 | 30 fps | | 相机类型 | 大视场鱼眼相机（Large FOV Fisheye Camera） | | 惯性测量单元（IMU） | 支持，6轴 | | 触觉阵列传感器 | 支持 | | 触觉阵列空间分辨率 | 1 毫米 | | 采集设备 | Gen DAS机械手（Gen DAS Gripper） | # 第一阶段内容 **我们已上传第一阶段数据集。本阶段数据仅为全部数据的一小部分，我们将尽快完成剩余技能相关数据的更新工作。** 第一阶段涵盖4大核心场景任务下的12项技能，总时长950小时，包含39761条数据片段，存储占用3.45 TB。 | 任务分类 | 技能名称 | |-------------------------------|--------------------------------| | 衣物折叠与拉链操作 | 折叠并收纳衣物 | | | 衣物拉链操作 | | 烹饪与厨房清洁 | 清洁容器 | | | 拧开瓶盖并倾倒 | | | 清洁碗具 | | 杂物整理 | 桌面物品整理 | | | 折叠毛巾 | | | 折叠并收纳购物袋 | | | 从抽屉取出物品 | | | 将物品放入抽屉 | | 鞋具处理 | 双手系鞋带 | | | 整理散落的鞋具 | 我们将同步在各大社交平台更新项目进展。除数据集外，我们还将提供格式转换、使用指南等相关支持，相关链接：GitHub - genrobot-ai/das-datakit。 # 联系我们使用过程中如有任何问题、建议或希望采集的场景/技能需求，欢迎随时与我们联系。让我们携手共建该项目，推动全人类技能的数字化转型。 X平台：https://x.com/GenrobotAI 领英（LinkedIn）：https://www.linkedin.com/company/108767412/admin/dashboard/ 电子邮箱：opendata@genrobot.ai # 数据集结构 mcap格式文件存储于文件目录结构的最底层叶文件夹中。注意：每个mcap文件对应一项任务数据。 # 数据格式双臂任务场景下：robot0与robot1分别对应左、右机械手。每个机械手包含以下话题（Topic）： python /robot0/sensor/camera0/compressed # 鱼眼相机图像数据——采用H.264格式压缩编码 /robot0/sensor/camera0/camera_info # 鱼眼相机内外参信息 /robot0/sensor/imu # 惯性测量单元（IMU）数据 /robot0/sensor/magnetic_encoder # 磁编码器数据：机械手开合距离 /robot0/vio/eef_pose # 末端执行器轨迹数据话题数据采用Protobuf进行序列化以实现持久化存储。 /robot0/sensor/camera0/compressed: protobuf // 压缩图像消息 message CompressedImage { // 图像时间戳 google.protobuf.Timestamp timestamp = 1; // 帧ID string frame_id = 4; // 压缩图像数据，H.264视频流 bytes data = 2; // 图像格式 // 支持格式：`webp`、`jpeg`、`png`、`h264` string format = 3; // 通用消息头，内部包含时间戳 Header header = 8; } message Header { string module_name = 1; uint32 sequence_num = 2; uint64 timestamp = 3; string topic_name = 4; double expect_hz = 5; repeated Input inputs = 6; } /robot0/sensor/camera0/camera_info: protobuf // 相机标定参数消息 message CameraCalibration { // 未使用字段 google.protobuf.Timestamp timestamp = 1; // 帧ID string frame_id = 9; // 图像宽度 fixed32 width = 2; // 图像高度 fixed32 height = 3; // 畸变模型名称 string distortion_model = 4; // 畸变参数 repeated double D = 5; // 相机内参矩阵（3x3行优先矩阵） // 将相机坐标系下的3D点投影至2D像素坐标，使用焦距(fx, fy)与主点(cx, cy) // // [fx 0 cx] // K = [ 0 fy cy] // [ 0 0 1] // repeated double K = 6; // 长度为9 // 校正矩阵（仅立体相机使用，3x3行优先矩阵） // 用于将相机坐标系旋转至理想立体成像平面，使两幅立体图像的极线平行 repeated double R = 7; // 长度为9 // 投影/相机矩阵（仅立体相机使用，3x4行优先矩阵） // [fx' 0 cx' Tx] // P = [ 0 fy' cy' Ty] // [ 0 0 1 0] repeated double P = 8; // 长度为12 // 相机到基坐标系的变换矩阵 repeated double T_b_c = 10; // 长度为7，格式为[tx ty tz qx qy qz qw] // 通用消息头 Header header = 11; } /robot0/sensor/imu: protobuf // IMU测量消息 message IMUMeasurement { // 通用消息头 arnold.common.proto.Header header = 1; // 帧ID string frame_id = 2; foxglove.Vector3 angular_velocity = 3; // 以g为单位的加速度数据 foxglove.Vector3 linear_acceleration = 4; // float temperature = 5; // repeated float angular_velocity_covariance = 6; // repeated float linear_acceleration_covariance = 7; } /robot0/sensor/magnetic_encoder: protobuf message MagneticEncoderMeasurement { // 通用消息头 arnold.common.proto.Header header = 1; // 帧ID string frame_id = 2; // 机械手手指开合距离，范围0-0.103m，0表示完全闭合 double value = 3; } /robot0/vio/eef_pose: protobuf // 三维空间中物体或参考帧的带时间戳位姿消息 message PoseInFrame { // 未使用字段 google.protobuf.Timestamp timestamp = 1; // 帧ID string frame_id = 2; // 三维空间位姿 foxglove.Pose pose = 3; // 线速度 foxglove.Vector3 linear_vel= 4; // 角速度 foxglove.Vector3 angular_vel = 5; // 通用消息头 arnold.common.proto.Header header = 6; } # 数据可视化方法可视化工具：https://monitor.genrobot.click/#/index # 数据加载方法参考实现：GitHub - genrobot-ai/das-datakit

提供机构：

maas

创建时间：

2026-01-04

搜集汇总

数据集介绍