BrunoM42/robocasa_target_PrepareCoffee

Name: BrunoM42/robocasa_target_PrepareCoffee
Creator: BrunoM42
Published: 2026-03-28 15:44:36
License: 暂无描述

Hugging Face2026-03-28 更新2026-03-29 收录

下载链接：

https://hf-mirror.com/datasets/BrunoM42/robocasa_target_PrepareCoffee

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: apache-2.0 task_categories: - robotics tags: - LeRobot configs: - config_name: default data_files: data/*/*.parquet --- This dataset was created using [LeRobot](https://github.com/huggingface/lerobot). ## Dataset Description - **Homepage:** [More Information Needed] - **Paper:** [More Information Needed] - **License:** apache-2.0 ## Dataset Structure [meta/info.json](meta/info.json): ```json { "codebase_version": "v3.0", "robot_type": "PandaOmron", "total_episodes": 514, "total_frames": 279534, "total_tasks": 1, "chunks_size": 1000, "fps": 20, "splits": { "train": "0:514" }, "data_path": "data/chunk-{chunk_index:03d}/file-{file_index:03d}.parquet", "video_path": "videos/{video_key}/chunk-{chunk_index:03d}/file-{file_index:03d}.mp4", "features": { "observation.images.robot0_eye_in_hand": { "dtype": "video", "shape": [ 256, 256, 3 ], "names": [ "height", "width", "channel" ], "video_info": { "video.fps": 20, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "has_audio": false }, "info": { "video.height": 256, "video.width": 256, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "video.fps": 20, "video.channels": 3, "has_audio": false } }, "observation.images.robot0_agentview_left": { "dtype": "video", "shape": [ 256, 256, 3 ], "names": [ "height", "width", "channel" ], "video_info": { "video.fps": 20, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "has_audio": false }, "info": { "video.height": 256, "video.width": 256, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "video.fps": 20, "video.channels": 3, "has_audio": false } }, "observation.images.robot0_agentview_right": { "dtype": "video", "shape": [ 256, 256, 3 ], "names": [ "height", "width", "channel" ], "video_info": { "video.fps": 20, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "has_audio": false }, "info": { "video.height": 256, "video.width": 256, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "video.fps": 20, "video.channels": 3, "has_audio": false } }, "annotation.human.task_description": { "dtype": "int64", "shape": [ 1 ], "fps": 20 }, "annotation.human.task_name": { "dtype": "int64", "shape": [ 1 ], "fps": 20 }, "observation.state": { "dtype": "float64", "shape": [ 16 ], "fps": 20 }, "action": { "dtype": "float64", "shape": [ 12 ], "fps": 20 }, "next.reward": { "dtype": "float32", "shape": [ 1 ], "fps": 20 }, "next.done": { "dtype": "bool", "shape": [ 1 ], "fps": 20 }, "timestamp": { "dtype": "float32", "shape": [ 1 ], "names": null, "fps": 20 }, "frame_index": { "dtype": "int64", "shape": [ 1 ], "names": null, "fps": 20 }, "episode_index": { "dtype": "int64", "shape": [ 1 ], "names": null, "fps": 20 }, "index": { "dtype": "int64", "shape": [ 1 ], "names": null, "fps": 20 }, "task_index": { "dtype": "int64", "shape": [ 1 ], "names": null, "fps": 20 } }, "data_files_size_in_mb": 100, "video_files_size_in_mb": 200 } ``` ## Citation **BibTeX:** ```bibtex [More Information Needed] ```

license: Apache 2.0 任务类别： - 机器人学标签： - LeRobot 配置项： - 配置名称：default 数据文件：data/*/*.parquet --- 本数据集基于[LeRobot](https://github.com/huggingface/lerobot)构建。 ## 数据集说明 - **主页**：暂未提供更多信息 - **论文**：暂未提供更多信息 - **许可证**：Apache 2.0 ## 数据集结构 `meta/info.json`： json { "codebase_version": "v3.0", "robot_type": "PandaOmron", "total_episodes": 514, "total_frames": 279534, "total_tasks": 1, "chunks_size": 1000, "fps": 20, "splits": { "train": "0:514" }, "data_path": "data/chunk-{chunk_index:03d}/file-{file_index:03d}.parquet", "video_path": "videos/{video_key}/chunk-{chunk_index:03d}/file-{file_index:03d}.mp4", "features": { "observation.images.robot0_eye_in_hand": { "dtype": "video", "shape": [256, 256, 3], "names": ["height", "width", "channel"], "video_info": { "video.fps": 20, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "has_audio": false }, "info": { "video.height": 256, "video.width": 256, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "video.fps": 20, "video.channels": 3, "has_audio": false } }, "observation.images.robot0_agentview_left": { "dtype": "video", "shape": [256, 256, 3], "names": ["height", "width", "channel"], "video_info": { "video.fps": 20, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "has_audio": false }, "info": { "video.height": 256, "video.width": 256, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "video.fps": 20, "video.channels": 3, "has_audio": false } }, "observation.images.robot0_agentview_right": { "dtype": "video", "shape": [256, 256, 3], "names": ["height", "width", "channel"], "video_info": { "video.fps": 20, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "has_audio": false }, "info": { "video.height": 256, "video.width": 256, "video.codec": "h264", "video.pix_fmt": "yuv420p", "video.is_depth_map": false, "video.fps": 20, "video.channels": 3, "has_audio": false } }, "annotation.human.task_description": { "dtype": "int64", "shape": [1], "fps": 20 }, "annotation.human.task_name": { "dtype": "int64", "shape": [1], "fps": 20 }, "observation.state": { "dtype": "float64", "shape": [16], "fps": 20 }, "action": { "dtype": "float64", "shape": [12], "fps": 20 }, "next.reward": { "dtype": "float32", "shape": [1], "fps": 20 }, "next.done": { "dtype": "bool", "shape": [1], "fps": 20 }, "timestamp": { "dtype": "float32", "shape": [1], "names": null, "fps": 20 }, "frame_index": { "dtype": "int64", "shape": [1], "names": null, "fps": 20 }, "episode_index": { "dtype": "int64", "shape": [1], "names": null, "fps": 20 }, "index": { "dtype": "int64", "shape": [1], "names": null, "fps": 20 }, "task_index": { "dtype": "int64", "shape": [1], "names": null, "fps": 20 } }, "data_files_size_in_mb": 100, "video_files_size_in_mb": 200 } ## 引用 **BibTeX格式：** bibtex [暂未提供更多信息]

提供机构：

BrunoM42

搜集汇总

数据集介绍

构建方式

在机器人操作学习领域，robocasa_target_PrepareCoffee数据集依托LeRobot框架构建，专注于咖啡制备这一具体任务。该数据集通过PandaOmron机器人平台采集了514个完整操作片段，总计包含279,534帧数据，并以20帧每秒的速率记录。数据以分块形式存储于Parquet文件中，每块约1000帧，确保了高效的数据管理与访问。构建过程中整合了多视角视觉信息与机器人状态数据，为后续算法训练提供了坚实的多模态基础。

使用方法

研究人员可通过LeRobot库或兼容的机器学习框架加载此数据集。数据以分块Parquet格式组织，支持按片段或帧级进行流式读取。典型应用流程包括加载观测图像、机器人状态及对应动作序列，进而训练端到端的策略网络或进行行为克隆。数据集已预分为训练集，涵盖全部514个片段，可直接用于模型训练与验证。使用时应遵循Apache 2.0许可协议，并注意数据中蕴含的时序依赖关系，合理设计网络结构以利用其多模态与高帧率的特性。

背景与挑战

背景概述

在机器人学习领域，家庭环境下的复杂任务执行一直是研究的前沿课题。robocasa_target_PrepareCoffee数据集应运而生，旨在为机器人咖啡制备这一具体任务提供高质量的多模态演示数据。该数据集由LeRobot项目创建，采用PandaOmron机器人平台，包含514个完整任务片段，总计超过27万帧数据，涵盖了手眼相机与全局视角的视觉观测、机器人状态及动作指令。其核心研究问题聚焦于如何通过大规模真实世界演示数据，推动机器人模仿学习与强化学习算法在精细化操作任务上的性能突破，为家庭服务机器人的实际应用奠定数据基础。

当前挑战

该数据集致力于解决机器人模仿学习中复杂长时序任务学习的挑战，特别是咖啡制备这类涉及多步骤、精细操作与物体交互的日常活动。其构建过程面临多重困难：首先，真实机器人数据采集成本高昂，需确保操作的安全性与一致性；其次，多视角视频数据与高维状态动作数据的精确同步与对齐存在技术难度；再者，任务标注的语义一致性维护，以及大规模数据的高效存储与读取，均对数据工程提出了严峻考验。这些挑战共同指向了构建高质量、可扩展机器人演示数据集的核心瓶颈。

常用场景

经典使用场景

在机器人学习领域，robocasa_target_PrepareCoffee数据集以其专注于单一任务——制作咖啡的精细操作而著称。该数据集通过PandaOmron机器人采集了514个完整操作序列，包含多视角视频与状态动作数据，为模仿学习与强化学习算法提供了丰富的训练资源。研究者能够利用这些高保真度的演示数据，训练机器人执行从抓取咖啡杯到操作咖啡机的复杂流程，从而在模拟环境中验证算法的泛化能力与鲁棒性。

解决学术问题

该数据集针对机器人操作中的长时程任务规划与精细动作控制等核心学术问题提供了解决方案。通过提供结构化的人类演示数据，它助力于解决模仿学习中分布偏移与样本效率低下的挑战，并为强化学习中的稀疏奖励问题提供了密集的监督信号。其意义在于推动了具身智能在家庭环境中的实际应用，为机器人执行日常任务奠定了数据基础，促进了跨模态感知与决策一体化研究的发展。

实际应用

robocasa_target_PrepareCoffee数据集的实际应用场景紧密围绕家庭服务机器人的自动化操作展开。在智能家居系统中，机器人可依据数据集学习的策略自主完成冲泡咖啡等日常任务，提升生活便利性。此外，该数据集亦可用于工业场景中的自动化流程培训，例如在餐饮服务业中辅助机器人进行标准化饮品制作，降低人力成本并保证操作一致性，展现了从实验室研究到现实部署的平滑过渡潜力。

数据集最近研究