five

10Kh-RealOmin-OpenData

收藏
魔搭社区2026-05-22 更新2026-01-10 收录
下载链接:
https://modelscope.cn/datasets/GenRobot.AI/10Kh-RealOmin-OpenData
下载链接
链接失效反馈
官方服务:
资源简介:
Boasting over 10,000 hours of cumulative data and 1 million+ clips, it ranks as the largest open-source embodied intelligence dataset in the industry. # Compared with other datasets, it has the following advantages: 1. Ample Data Volume & Strong Generalization Each skill is supported by sufficient data, collected from over 3,000 households and nearly 10,000 distinct fine-grained targets. It avoids simple repetitions and ensures robust generalization. 2. Authentic Scenarios & Focused Skills Captured from natural operations in real households, we avoid skill fragmentation that compromises quality. Instead, we focus on 10 key household scenarios and 30 core skills. 3. Bimanual & Long-duration Tasks Full recordings of the entire process of complex household chores and cleaning. Data collection by GenDAS Gripper. 4. Multi-modal & High-quality Data Includes large-FOV raw images, trajectories, annotations and joint movements. Trajectory reconstruction ensures industry-leading precision and quality. # Dataset Statistics | Attribute | Value | |-------------------------|----------------------| | Median Clip Length | 210.0 seconds | | Storage Size | 95 TB | | Format | mcap | | Resolution | 1600*1296 | | Frame Rate | 30 fps | | Camera Type | Large FOV Fisheye Camera | | IMU | Yes 6-axis | | Tactile Array Spatial | Yes | | Array Spatial Resolution | 1 mm | | Device |Gen DAS Gripper | # Stage 1 Content: **We have uploaded the data of Stage 1. This is only a small fraction and we will complete updates for the remaining skills as soon as possible.** Stage 1 covers 12 skills across 4 major scenario tasks .Total duration: 950 hours, clips: 39,761, storage 3.45TB. | Task | Skill | |-------------------------------|--------------------------------| | Folding_Clothes_and_Zipper_Operations | fold_and_store_clothes | | | zip_clothes | | Cooking_and_Kitchen_Clean | clean_container | | | unscrew_bottle_cap_and_pour | | | clean_bowl | | Organize_Clutter | desktop_object_sorting | | | fold_towel | | | fold_and_store_shopping_bag | | | drawer_to_take_items | | | drawer_to_place_items | | Shoes_Handling | lace_up_shoes_with_both_hands | | | organize_scattered_shoes | And synchronize the progress across major social platforms.In addition to the data, we will also provide relevant support including format conversion and usage guidance,here is the link GitHub - genrobot-ai/das-datakit. # Contact Us Any questions, suggestions or desired data collection scenarios/skills are welcome during usage. Let’s co-build this project to digitize all human skills. X:https://x.com/GenrobotAI Linkin:https://www.linkedin.com/company/108767412/admin/dashboard/ Email:opendata@genrobot.ai # Dataset Structure ``` The mcap files are stored in the final leaf folders of the file directory structure.Note: Each mcap file represents one piece of task data. ``` # Data Format Dual-arm tasks: robot0 and robot1 represent the left and right grippers respectively. Each gripper contains the following topics: ``` python /robot0/sensor/camera0/compressed # Fisheye camera image data — compressed and encoded in H.264 format. /robot0/sensor/camera0/camera_info # Fisheye Intrinsic and Extrinsic Parameters /robot0/sensor/imu # Inertial Measurement Unit (IMU) Data /robot0/sensor/magnetic_encoder # Magnetic encoder data: gripper opening distance /robot0/vio/eef_pose # Trajectory data ``` Topics are serialized using Protobuf for persistent storage /robot0/sensor/camera0/compressed: ``` protobuf // A compressed image message CompressedImage { // Timestamp of image google.protobuf.Timestamp timestamp = 1; // frame id string frame_id = 4; // Compressed image data, h264 video stream bytes data = 2; // Image format // Supported values: `webp`, `jpeg`, `png`, `h264` string format = 3; // common header, timestamp is inside it Header header = 8; } message Header { string module_name = 1; uint32 sequence_num = 2; uint64 timestamp = 3; string topic_name = 4; double expect_hz = 5; repeated Input inputs = 6; } ``` /robot0/sensor/camera0/camera_info: ``` protobuf // Camera calibration parameters message CameraCalibration { // not used google.protobuf.Timestamp timestamp = 1; // frame id string frame_id = 9; // Image width fixed32 width = 2; // Image height fixed32 height = 3; // Name of distortion model string distortion_model = 4; // Distortion parameters repeated double D = 5; // Intrinsic camera matrix (3x3 row-major matrix) // // A 3x3 row-major matrix for the raw (distorted) image. // // Projects 3D points in the camera coordinate frame to 2D pixel coordinates using the focal lengths (fx, fy) and principal point (cx, cy). // // ``` // [fx 0 cx] // K = [ 0 fy cy] // [ 0 0 1] // ``` repeated double K = 6; // length 9 // Rectification matrix (stereo cameras only, 3x3 row-major matrix) // // A rotation matrix aligning the camera coordinate system to the ideal stereo image plane so that epipolar lines in both stereo images are parallel. repeated double R = 7; // length 9 // Projection/camera matrix (stereo cameras only, 3x4 row-major matrix) // [fx' 0 cx' Tx] // P = [ 0 fy' cy' Ty] // [ 0 0 1 0] repeated double P = 8; // length 12 // transform from camera to base frame repeated double T_b_c = 10; // length 7, [tx ty tz qx qy qz qw] // common header Header header = 11; } ``` /robot0/sensor/imu: ``` protobuf // IMU message message IMUMeasurement { // common header arnold.common.proto.Header header = 1; // frame id string frame_id = 2; foxglove.Vector3 angular_velocity = 3; // Acceleration data in g-force units foxglove.Vector3 linear_acceleration = 4; // float temperature = 5; // repeated float angular_velocity_covariance = 6; // repeated float linear_acceleration_covariance = 7; } ``` /robot0/sensor/magnetic_encoder: ``` protobuf message MagneticEncoderMeasurement { // common header arnold.common.proto.Header header = 1; // frame id string frame_id = 2; // Distance between gripper fingers, 0-0.103m, 0 means closed double value = 3; } ``` /robot0/vio/eef_pose: ``` protobuf // A timestamped pose for an object or reference frame in 3D space message PoseInFrame { // not used google.protobuf.Timestamp timestamp = 1; // Frame id string frame_id = 2; // Pose in 3D space foxglove.Pose pose = 3; // linear vel foxglove.Vector3 linear_vel= 4; // angular_vel foxglove.Vector3 angular_vel = 5; // common header arnold.common.proto.Header header = 6; } ``` # How to Vis Data web view tool; ``` https://monitor.genrobot.click/#/index ``` # How to Load Data reference: ``` GitHub - genrobot-ai/das-datakit

本数据集累计数据时长超10000小时,包含百万余条片段,是业内规模最大的开源具身智能(embodied intelligence)数据集。 # 相较于其他同类数据集,本数据集具备以下优势: 1. 数据体量充足,泛化能力强劲 每项技能均配有足量数据,采集自3000余户家庭与近10000个差异化细粒度目标,避免了简单重复,确保模型具备优秀的泛化性能。 2. 场景真实,技能聚焦 数据采集自真实家庭中的自然操作场景,规避了破坏数据质量的技能碎片化问题,本次数据集聚焦10大核心家居场景与30项关键技能。 3. 支持双手操作与长时任务 完整记录复杂家居劳作与清洁流程的全过程,数据由GenDAS机械手(GenDAS Gripper)采集。 4. 多模态且高质量 包含大视场原始图像、轨迹、标注与关节运动数据。通过轨迹重构技术,实现了行业领先的精度与数据质量。 # 数据集统计信息 | 属性 | 数值 | |-------------------------|----------------------| | 片段中位时长 | 210.0 秒 | | 存储容量 | 95 TB | | 存储格式 | mcap | | 分辨率 | 1600*1296 | | 帧率 | 30 fps | | 相机类型 | 大视场鱼眼相机(Large FOV Fisheye Camera) | | 惯性测量单元(IMU) | 支持,6轴 | | 触觉阵列传感器 | 支持 | | 触觉阵列空间分辨率 | 1 毫米 | | 采集设备 | Gen DAS机械手(Gen DAS Gripper) | # 第一阶段内容 **我们已上传第一阶段数据集。本阶段数据仅为全部数据的一小部分,我们将尽快完成剩余技能相关数据的更新工作。** 第一阶段涵盖4大核心场景任务下的12项技能,总时长950小时,包含39761条数据片段,存储占用3.45 TB。 | 任务分类 | 技能名称 | |-------------------------------|--------------------------------| | 衣物折叠与拉链操作 | 折叠并收纳衣物 | | | 衣物拉链操作 | | 烹饪与厨房清洁 | 清洁容器 | | | 拧开瓶盖并倾倒 | | | 清洁碗具 | | 杂物整理 | 桌面物品整理 | | | 折叠毛巾 | | | 折叠并收纳购物袋 | | | 从抽屉取出物品 | | | 将物品放入抽屉 | | 鞋具处理 | 双手系鞋带 | | | 整理散落的鞋具 | 我们将同步在各大社交平台更新项目进展。除数据集外,我们还将提供格式转换、使用指南等相关支持,相关链接:GitHub - genrobot-ai/das-datakit。 # 联系我们 使用过程中如有任何问题、建议或希望采集的场景/技能需求,欢迎随时与我们联系。让我们携手共建该项目,推动全人类技能的数字化转型。 X平台:https://x.com/GenrobotAI 领英(LinkedIn):https://www.linkedin.com/company/108767412/admin/dashboard/ 电子邮箱:opendata@genrobot.ai # 数据集结构 mcap格式文件存储于文件目录结构的最底层叶文件夹中。注意:每个mcap文件对应一项任务数据。 # 数据格式 双臂任务场景下:robot0与robot1分别对应左、右机械手。每个机械手包含以下话题(Topic): python /robot0/sensor/camera0/compressed # 鱼眼相机图像数据——采用H.264格式压缩编码 /robot0/sensor/camera0/camera_info # 鱼眼相机内外参信息 /robot0/sensor/imu # 惯性测量单元(IMU)数据 /robot0/sensor/magnetic_encoder # 磁编码器数据:机械手开合距离 /robot0/vio/eef_pose # 末端执行器轨迹数据 话题数据采用Protobuf进行序列化以实现持久化存储。 /robot0/sensor/camera0/compressed: protobuf // 压缩图像消息 message CompressedImage { // 图像时间戳 google.protobuf.Timestamp timestamp = 1; // 帧ID string frame_id = 4; // 压缩图像数据,H.264视频流 bytes data = 2; // 图像格式 // 支持格式:`webp`、`jpeg`、`png`、`h264` string format = 3; // 通用消息头,内部包含时间戳 Header header = 8; } message Header { string module_name = 1; uint32 sequence_num = 2; uint64 timestamp = 3; string topic_name = 4; double expect_hz = 5; repeated Input inputs = 6; } /robot0/sensor/camera0/camera_info: protobuf // 相机标定参数消息 message CameraCalibration { // 未使用字段 google.protobuf.Timestamp timestamp = 1; // 帧ID string frame_id = 9; // 图像宽度 fixed32 width = 2; // 图像高度 fixed32 height = 3; // 畸变模型名称 string distortion_model = 4; // 畸变参数 repeated double D = 5; // 相机内参矩阵(3x3行优先矩阵) // 将相机坐标系下的3D点投影至2D像素坐标,使用焦距(fx, fy)与主点(cx, cy) // // [fx 0 cx] // K = [ 0 fy cy] // [ 0 0 1] // repeated double K = 6; // 长度为9 // 校正矩阵(仅立体相机使用,3x3行优先矩阵) // 用于将相机坐标系旋转至理想立体成像平面,使两幅立体图像的极线平行 repeated double R = 7; // 长度为9 // 投影/相机矩阵(仅立体相机使用,3x4行优先矩阵) // [fx' 0 cx' Tx] // P = [ 0 fy' cy' Ty] // [ 0 0 1 0] repeated double P = 8; // 长度为12 // 相机到基坐标系的变换矩阵 repeated double T_b_c = 10; // 长度为7,格式为[tx ty tz qx qy qz qw] // 通用消息头 Header header = 11; } /robot0/sensor/imu: protobuf // IMU测量消息 message IMUMeasurement { // 通用消息头 arnold.common.proto.Header header = 1; // 帧ID string frame_id = 2; foxglove.Vector3 angular_velocity = 3; // 以g为单位的加速度数据 foxglove.Vector3 linear_acceleration = 4; // float temperature = 5; // repeated float angular_velocity_covariance = 6; // repeated float linear_acceleration_covariance = 7; } /robot0/sensor/magnetic_encoder: protobuf message MagneticEncoderMeasurement { // 通用消息头 arnold.common.proto.Header header = 1; // 帧ID string frame_id = 2; // 机械手手指开合距离,范围0-0.103m,0表示完全闭合 double value = 3; } /robot0/vio/eef_pose: protobuf // 三维空间中物体或参考帧的带时间戳位姿消息 message PoseInFrame { // 未使用字段 google.protobuf.Timestamp timestamp = 1; // 帧ID string frame_id = 2; // 三维空间位姿 foxglove.Pose pose = 3; // 线速度 foxglove.Vector3 linear_vel= 4; // 角速度 foxglove.Vector3 angular_vel = 5; // 通用消息头 arnold.common.proto.Header header = 6; } # 数据可视化方法 可视化工具:https://monitor.genrobot.click/#/index # 数据加载方法 参考实现:GitHub - genrobot-ai/das-datakit
提供机构:
maas
创建时间:
2026-01-04
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集是行业最大的开源具身智能数据集,累计超过1万小时数据和100万+片段,具备强大的泛化能力和真实家庭场景采集。它专注于10个关键场景和30个核心技能,提供多模态高质量数据,包括双手机器人任务和高精度传感器信息。第一阶段已上传部分内容,涵盖12项技能,数据格式为mcap,支持广泛的应用和研究。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作