five

KlingTeam/Context-as-Memory-Dataset

收藏
Hugging Face2025-10-09 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/KlingTeam/Context-as-Memory-Dataset
下载链接
链接失效反馈
官方服务:
资源简介:
<div align="center"> <h1>Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval</h1> <h1>SIGGRAPH Asia 2025</h1> <p> <a href="https://context-as-memory.github.io/">[Project page]</a> <a href="https://arxiv.org/pdf/2506.03141">[ArXiv]</a> <a href="https://huggingface.co/datasets/KwaiVGI/Context-as-Memory-Dataset">[Dataset]</a> </p> </div> # File Structure To prepare the dataset for use, merge the parts into a single zip file using the following command: ```bash cat Context-as-Memory-Dataset_* > Context-as-Memory-Dataset.zip ``` After extracting `Context-as-Memory-Dataset.zip`, the dataset will be organized as follows: ``` Context-as-Memory-Dataset ├── frames │   ├── AncientTempleEnv_0 │   │   ├── 0000.png │   │   ├── 0001.png │   │   ├── 0002.png │   │   └── ... │   ├── AncientTempleEnv_1 │   │   ├── 0000.png │   │   ├── 0001.png │   │   ├── 0002.png │   │   └── ... │   └── ... │   ├── jsons │   ├── AncientTempleEnv_0.json │   ├── AncientTempleEnv_1.json │   └── ... │ ├── overlap_labels │   ├── AncientTempleEnv_0 │   │   ├── 0.json │   │   ├── 1.json │   │   ├── 2.json │   │   └── ... │   ├── AncientTempleEnv_1 │   │   ├── 0.json │   │   ├── 1.json │   │   ├── 2.json │   │   └── ... │   └── ... │   └── captions.txt ``` # Explanation of Dataset Parts - **`frames/`**: 100 subdirectories, each containing 7,601 video frame images. - **`jsons/`**: 100 JSON files, each storing the camera pose (position + rotation) of every frame in the corresponding long video. - **`overlap_labels/`**: 100 subdirectories, each containing 7,601 JSON files, where each file records the indices of overlapping frames corresponding to that frame. - **`captions.txt`**: Captions annotated for a segment of a long video, from a given starting frame to an ending frame. - We also provide a simple code file, `tools.py`, which can convert (x, y, z, yaw, pitch) into RT, and can also select a specific frame as the reference frame to align the RT of other frames to its coordinate system.
提供机构:
KlingTeam
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作