TrainThemAI/POV-Egocentric-Video-Robotics-FHD-Samples

Name: TrainThemAI/POV-Egocentric-Video-Robotics-FHD-Samples
Creator: TrainThemAI
Published: 2026-04-10 04:48:10
License: 暂无描述

Hugging Face2026-04-10 更新2026-04-12 收录

下载链接：

https://hf-mirror.com/datasets/TrainThemAI/POV-Egocentric-Video-Robotics-FHD-Samples

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: mit task_categories: - video-classification - robotics tags: - egocentric - pov - robotics - human-demonstration - vla - manipulation - activities-of-daily-living pretty_name: TrainThem AI - Egocentric POV FHD Samples size_categories: - n<1K --- # TrainThem AI — Egocentric POV FHD Samples ## Dataset Summary This dataset contains high-fidelity, egocentric (first-person POV) video samples designed specifically for training generalist robotics policies (e.g., π0, π1, VLA models) and humanoid manipulation systems. The data is collected by [TrainThem AI](https://trainthemai.com), a specialized agency providing diverse, rights-cleared human demonstration data at scale — coordinated via Slack, filmed with head-mounted cameras, and QA'd for robotics pre-training. --- ## Why Egocentric Video for Robot Training? Robots trained to manipulate objects need to understand the world from the perspective of an embodied agent — not a third-person camera or a fixed overhead view. Head-mounted egocentric video provides: - **The correct visual frame of reference** — the same POV a robot with a head-mounted camera experiences - **Natural hand-eye coordination signal** — hands in frame throughout, matching the robot's own kinematic structure - **Real-world distribution** — lighting variation, clutter, diverse objects and surfaces that sim-to-real transfer cannot replicate - **Dense action supervision** — continuous manipulation sequences ideal for behavioral cloning and VLA fine-tuning This is the data modality used by leading robotics labs for pre-training foundation models (e.g., RT-2, π0, RoboFlamingo, GR00T). --- ## Video Content Three continuous activity episodes demonstrating Activities of Daily Living (ADL) and fine-motor manipulation: | File | Activity | Focus | |---|---|---| | `tidying_the_bedroom.mp4` | Bedroom tidying | Object manipulation, spatial reasoning, organization | | `organizing_bathroom.mp4` | Bathroom organization | Fine-motor sorting, varied materials, object placement | | `washing_dishes.mp4` | Dish washing | Wet environment, bimanual coordination, tool use | --- ## Technical Specifications | Property | Value | |---|---| | Resolution | 1920×1080 (FHD) | | Frame rate | 30 fps | | Perspective | Head-mounted egocentric (true POV) | | Environment | Real-world domestic, non-simulated | | Hands in frame | >90% of duration | | Action density | No idle sequences >3 seconds | | Rights | Fully cleared for AI training use | | Total sample size | ~5.4 GB | --- ## QA Standards All data from TrainThem AI meets the following criteria before delivery: - True head-mounted POV (not handheld or chest-mounted) - Hands consistently in frame during manipulation - Action-dense — dead time removed - Real domestic environments (not staged or simulated) - Rights cleared — all collectors sign data licensing agreements --- ## Full Dataset & Bespoke Collection These three clips are a limited evaluation sample. TrainThem AI currently operates at **150+ hours per day** of fully QA'd egocentric data capacity, covering: - **100+ unique real-world environments** - **80+ task subcategories** (kitchen, workshop, office, outdoor, etc.) - Optional add-ons: 3D hand tracking, depth data (RGB-D), language annotations, multi-camera rigs We offer both **off-the-shelf bulk data** and **bespoke collection** designed to your exact task taxonomy and annotation schema. --- ## Contact To discuss bulk licensing, bespoke collection, or access to our full inventory: - **Website:** [trainthemai.com](https://trainthemai.com) - **Email:** diego.pousa@trainthemai.com --- ## Citation If you use this data in your research or products, please cite:

提供机构：

TrainThemAI

5,000+

优质数据集

54 个

任务类型

进入经典数据集