five

KlingTeam/HM-World

收藏
Hugging Face2026-04-22 更新2026-05-10 收录
下载链接:
https://hf-mirror.com/datasets/KlingTeam/HM-World
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: HM-World license: apache-2.0 language: - en tags: - video - computer-vision - world-model - multimodal --- <h1 align="center">Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models</h1> <div align="center"> <a href="https://kj-chen666.github.io/Hybrid-Memory-in-Video-World-Models/"><img src="https://img.shields.io/badge/Project-Page-orange.svg?logo=googlehome" alt="Project Page"></a> <a href="https://github.com/H-EmbodVis/HyDRA"><img src="https://img.shields.io/badge/GitHub-Repository-black?logo=github" alt="GitHub"></a> <a href="https://arxiv.org/pdf/2603.25716"><img src="https://img.shields.io/badge/arXiv-Paper-b31b1b?logo=Arxiv" alt="arXiv"></a> </div> ## HM-World `HM-World` is a dataset for hybrid memory in dynamic video world models. It provides video sequences, foreground masks, camera pose annotations, character pose annotations, event timestamps, and text captions for each sample. ## Usage Merge the split archives and extract the dataset with: ```bash cat HM-World_* | tar -xzvf - ``` ## Dataset Structure ```text HM-World/ ├── sample1/ │ ├── cond.mp4 │ ├── tgt.mp4 │ ├── cond_mask.mp4 │ ├── tgt_mask.mp4 │ ├── camera.json │ ├── character.json │ └── check.json ├── sample2/ │ └── ... ├── ... └── caption.txt ``` ## File Description - `cond.mp4`: condition video. - `tgt.mp4`: target video. - `cond_mask.mp4`: foreground mask video for the condition video. - `tgt_mask.mp4`: foreground mask video for the target video. - `camera.json`: camera pose information. - `character.json`: character pose information. - `check.json`: timestamps that record when the subjects enters or leaves the frame. - `caption.txt`: captions for all samples in the dataset.
提供机构:
KlingTeam
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作