five

MicroAGI00

收藏
魔搭社区2025-11-27 更新2025-11-03 收录
下载链接:
https://modelscope.cn/datasets/MicroAGI-Labs/MicroAGI00
下载链接
链接失效反馈
官方服务:
资源简介:
--- # MicroAGI00: MicroAGI Egocentric Dataset (2025) > License: MicroAGI00 Open Use, No-Resale v1.0 (see `LICENSE`). > No resale: You may not sell or paywall this dataset or derivative data. Trained models/outputs may be released under any terms. ## Overview MicroAGI00 is a large-scale egocentric RGB+D dataset of human manipulation in https://behavior.stanford.edu/challenge/index.html tasks. ## Quick facts * Modality: synchronized RGB + 16‑bit depth + IMU + annotations * Resolution & rate (RGB): 1920×1080 @ 30 FPS (in MCAP) * Depth: 16‑bit, losslessly compressed inside MCAP * Scale: ≈1,000,000 synchronized RGB frames and ≈1,000,000 depth frames (≈1M frame pairs) * Container: `.mcap` (all signals + annotations) * Previews: For as sample for only some bags `.mp4` per sequence (annotated RGB; visualized native depth) * Annotations: Only in %5 of the dataset, hand landmarks and short action text ## What’s included per sequence * One large **MCAP** file containing: * RGB frames (1080p/30 fps) * 16‑bit depth stream (lossless compression) * IMU data (as available) * For Some Data the Embedded annotations (hands, action text) **MP4** preview videos: * Annotated RGB (for quick review) * Visualized native depth map (for quick review) > Note: MP4 previews may be lower quality than MCAP due to compression and post‑processing. Research use should read from MCAP. ## Annotations Annotations are generated by our in‑house. ### Hand annotations 21 Joints (not all shown below as it would be too long) (per frame) — JSON schema example ``` { "frame_number": 9, "timestamp_seconds": 0.3, "resolution": { "width": 1920, "height": 1080 }, "hands": [ { "hand_index": 0, "landmarks": [ { "id": 0, "name": "WRIST", "x": 0.7124036550521851, "y": 0.7347621917724609, "z": -1.444301744868426e-07, "visibility": 0.0 }, ], "hand": "Left", "confidence": 0.9268525838851929 }, { "hand_index": 1, "landmarks": [ { "id": 0, "name": "WRIST", "x": 0.4461262822151184, "y": 0.35183972120285034, "z": -1.2342320587777067e-07, "visibility": 0.0 }, "hand": "Right", "confidence": 0.908446729183197 } ], "frame_idx": 9, "exact_frame_timestamp": 1758122341583104000, "exact_frame_timestamp_sec": 1758122341.583104 } ``` ### Text (action) annotations (per frame/window) — JSON schema example ``` { "schema_version": "v1.0", "action_text": "Right hand, holding a knife, is chopping cooked meat held by the left hand on the red cutting board.", "confidence": 1.0, "source": { "model": "MicroAGI, MAGI01" }, "exact_frame_timestamp": 1758122341583104000, "exact_frame_timestamp_sec": 1758122341.583104 } ``` ## Data access and structure * Each top-level sample folder contains: One folder of strong heavy mcap dump, one folder of annotated mcap dump, one folder of mp4 previews * All authoritative signals and annotations are inside the MCAP. Use the MP4s for quick visual QA only. ## Getting started * Inspect an MCAP: `mcap info your_sequence.mcap` * Extract messages: `mcap cat --topics <topic> your_sequence.mcap > out.bin` * Python readers: `pip install mcap` (see the MCAP Python docs) or any MCAP-compatible tooling. Typical topics include RGB, depth, IMU, and annotation channels. ## Intended uses * Policy and skill learning (robotics/VLA) * Action detection and segmentation * Hand/pose estimation and grasp analysis * Depth-based reconstruction, SLAM, scene understanding * World-model pre-post training ## Services and custom data MicroAGI provides on-demand: * Real‑to‑Sim pipelines * ML‑enhanced 3D point clouds and SLAM reconstructions * New data capture via our network of skilled tradespeople and factory workers (often below typical market cost) * Enablement for your workforce to wear our device and run through our processing pipeline Typical lead times: under two weeks (up to four weeks for large jobs). ## How to order more Email `data@micro-agi.com` with: * Task description * Desired hours or frame counts * Proposed price We will reply within one business day with lead time and final pricing. Questions: `info@micro-agi.com` ## License This dataset is released under the MicroAGI00 Open Use, No‑Resale License v1.0 (custom). See [`LICENSE`](./LICENSE). Redistribution must be free‑of‑charge under the same license. Required credit: "This work uses the MicroAGI00 dataset (MicroAGI, 2025)." ## Attribution reminder Public uses of the Dataset or Derivative Data must include the credit line above in a reasonable location for the medium (papers, repos, product docs, dataset pages, demo descriptions). Attribution is appreciated but not required for Trained Models or Outputs.

# MicroAGI00:MicroAGI 第一视角数据集(2025) > 授权协议:MicroAGI00 非转售开放使用协议v1.0(详见`LICENSE`文件)。 > 非转售条款:您不得将本数据集或其衍生数据进行售卖或设置付费墙。基于本数据集训练得到的模型/输出可采用任意协议发布。 ## 概述 MicroAGI00 是一个大规模人类操作第一视角RGB+D数据集,数据采集自[斯坦福行为挑战平台](https://behavior.stanford.edu/challenge/index.html)的任务场景。 ## 核心参数 * 模态:同步RGB图像 + 16位深度图 + 惯性测量单元(Inertial Measurement Unit,IMU)数据 + 标注信息 * RGB分辨率与帧率:1920×1080,30帧/秒(存储格式为MCAP) * 深度图:16位精度,在MCAP文件内采用无损压缩 * 数据规模:约100万对同步RGB与深度图帧(总计约100万RGB帧、100万深度图帧) * 存储容器:`.mcap`格式(包含所有信号与标注信息) * 预览文件:仅部分序列提供`.mp4`格式预览样本(含标注RGB帧与可视化原生深度图) * 标注覆盖:仅5%的数据集包含手部关键点与简短动作文本标注 ## 单序列包含内容 每个序列包含以下内容: * 一个大型**MCAP**文件,内含: * RGB帧(1080p分辨率,30帧/秒) * 16位深度数据流(采用无损压缩) * 惯性测量单元数据(视采集情况提供) * 部分数据内置的标注信息(手部关键点、动作文本) **MP4**格式预览视频: * 带标注的RGB帧(用于快速预览) * 原生深度图可视化结果(用于快速预览) > 注意:由于压缩与后处理操作,MP4预览文件的画质可能低于MCAP原始文件。学术研究应优先读取MCAP格式数据。 ## 标注说明 本数据集的标注均由团队自研流程生成。 ### 手部标注(21个关节点,因篇幅限制未完全展示下方示例)——每帧标注的JSON schema示例 { "frame_number": 9, "timestamp_seconds": 0.3, "resolution": { "width": 1920, "height": 1080 }, "hands": [ { "hand_index": 0, "landmarks": [ { "id": 0, "name": "WRIST", "x": 0.7124036550521851, "y": 0.7347621917724609, "z": -1.444301744868426e-07, "visibility": 0.0 }, ], "hand": "Left", "confidence": 0.9268525838851929 }, { "hand_index": 1, "landmarks": [ { "id": 0, "name": "WRIST", "x": 0.4461262822151184, "y": 0.35183972120285034, "z": -1.2342320587777067e-07, "visibility": 0.0 }, "hand": "Right", "confidence": 0.908446729183197 } ], "frame_idx": 9, "exact_frame_timestamp": 1758122341583104000, "exact_frame_timestamp_sec": 1758122341.583104 } ### 文本(动作)标注(每帧/每窗口)——JSON schema示例 { "schema_version": "v1.0", "action_text": "Right hand, holding a knife, is chopping cooked meat held by the left hand on the red cutting board.", "confidence": 1.0, "source": { "model": "MicroAGI, MAGI01" }, "exact_frame_timestamp": 1758122341583104000, "exact_frame_timestamp_sec": 1758122341.583104 } ## 数据访问与组织结构 * 每个顶级样本文件夹包含:原始MCAP数据文件夹、带标注MCAP数据文件夹、MP4预览文件文件夹 * 所有权威信号与标注信息均存储于MCAP文件中。MP4文件仅可用于快速视觉质量检查(QA)。 ## 快速上手 * 查看MCAP文件信息:`mcap info your_sequence.mcap` * 提取指定话题消息:`mcap cat --topics <topic> your_sequence.mcap > out.bin` * Python读取工具:执行`pip install mcap`(详见MCAP官方Python文档)或使用任意兼容MCAP格式的工具。常见话题包括RGB、深度图、IMU与标注通道。 ## 预期应用场景 * 策略与技能学习(机器人学/视觉语言动作模型(Visual Language Action, VLA)) * 动作检测与分割 * 手部/姿态估计与抓取分析 * 基于深度图的三维重建、同步定位与地图构建(Simultaneous Localization and Mapping, SLAM)、场景理解 * 世界模型预训练与微调 ## 定制化服务与专属数据 MicroAGI 可按需提供以下服务: * 真实场景到仿真环境的转换管线 * 机器学习增强的三维点云与SLAM重建结果 * 通过我们的熟练技工与工厂工人网络采集新数据(价格通常低于市场均价) * 为您的团队提供设备佩戴与数据处理流水线部署支持 常规交付周期:两周以内(大型项目最长不超过四周)。 ## 定制数据订购方式 请发送邮件至`data@micro-agi.com`,并提供以下信息: * 任务描述 * 所需采集时长或帧数量 * 预估报价 我们将在一个工作日内回复交付周期与最终定价。 咨询问题请联系:`info@micro-agi.com` ## 授权协议 本数据集采用MicroAGI00 非转售开放使用协议v1.0(定制协议),详见[`LICENSE`](./LICENSE)文件。再分发必须以相同协议免费进行。强制引用要求:"This work uses the MicroAGI00 dataset (MicroAGI, 2025)." ## 引用提示 若公开使用本数据集或其衍生数据,必须在对应媒介(论文、代码仓库、产品文档、数据集页面、演示说明等)的合理位置添加上述引用语句。对于基于本数据集训练得到的模型或输出,引用为推荐项而非强制要求。
提供机构:
maas
创建时间:
2025-10-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作