five

Physis-AI/wm-eval-gt-ewmbench

收藏
Hugging Face2026-04-02 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Physis-AI/wm-eval-gt-ewmbench
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-nc-sa-4.0 task_categories: - video-classification - robotics tags: - world-model - evaluation - embodied-ai - ground-truth pretty_name: "WM-Eval GT: EWMBench Ground Truth" size_categories: - 1K<n<10K --- # WM-Eval GT: EWMBench Ground Truth Ground truth data for [EWMBench](https://github.com/AgibotTech/EWMBench) evaluation, extracted and restructured for direct use by the [wm-evaluation-harness](https://github.com/Physis-AI/wm-evaluation-harness) framework. ## Source Extracted from [`agibot-world/EWMBench`](https://huggingface.co/datasets/agibot-world/EWMBench) (`gt_dataset.tar`). Original dataset by [AgibotTech](https://github.com/AgibotTech). ## Dataset Structure ``` {task_id}/ ├── {episode_id}/ │ ├── prompt/ │ │ ├── init_frame.png # Initial frame for generation │ │ └── prompt.txt # Task description │ └── video/ │ ├── frame_00000.jpg # Ground truth frame sequence │ ├── frame_00001.jpg │ └── ... ``` ## Statistics | Item | Count | |------|-------| | Tasks | 7 (367, 392, 497, 511, 543, 558, 574) | | Episodes per task | 3 | | Total episodes | 21 | | Total files | 3,959 | | Size | ~129 MB | ## Usage with wm-evaluation-harness This dataset is auto-downloaded by the framework when referenced as a HuggingFace dataset repo ID: ```yaml # configs/ewmbench_hf.yaml evaluation: gt_dir: Physis-AI/wm-eval-gt-ewmbench # auto-download benchmarks: ewmbench: metrics: [psnr, ssim] ``` ```bash wm-eval evaluate --config configs/ewmbench_hf.yaml --videos <your_pred_dir> ``` Or used programmatically: ```python from wm_eval.evaluation.models.registry import ModelRegistry registry = ModelRegistry() gt_path = registry.resolve("Physis-AI/wm-eval-gt-ewmbench", repo_type="dataset") ``` ## License This dataset follows the original [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license from AgibotTech/EWMBench. ## Citation ```bibtex @article{ewmbench2025, title={EWMBench: Evaluating Scene Understanding and Generation Quality for Embodied World Models}, author={AgibotTech}, year={2025}, eprint={2505.09694}, archivePrefix={arXiv} } ```
提供机构:
Physis-AI
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作