Jazzcharles/epic_kitchen_videomae_L14_feature_fps8
收藏Hugging Face2024-05-07 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/Jazzcharles/epic_kitchen_videomae_L14_feature_fps8
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- video-classification
- video-retrieval
language:
- en
size_categories:
- 100M<n<1B
---
## 📙 Overview
EPIC-Kitchen-100 video features extracted by VideoMAE_L14 at 8 fps. It is used for evaluating the video-text retrieval ability of [EgoInstructor](https://arxiv.org/pdf/2401.00789).
It contains 700 files, each file (e.g. P01_01.pth.tar) is a TxD feature vector, where T refers to the length of the video and D is 768.
## 🏋️ How-To-Use
Please refer to code [EgoInstructor](https://github.com/Jazzcharles/Egoinstructor/) for details.
## 🎓 Citation
```
@article{xu2024retrieval,
title={Retrieval-augmented egocentric video captioning},
author={Xu, Jilan and Huang, Yifei and Hou, Junlin and Chen, Guo and Zhang, Yuejie and Feng, Rui and Xie, Weidi},
journal={arXiv preprint arXiv:2401.00789},
year={2024}
}
```
提供机构:
Jazzcharles
原始信息汇总
数据集概述
基本信息
- 许可证: Apache-2.0
- 任务类别:
- 视频分类
- 视频检索
- 语言: 英语
- 数据集大小: 100M<n<1B
数据集内容
- 描述: EPIC-Kitchen-100视频特征,由VideoMAE_L14模型在8 fps下提取,用于评估EgoInstructor的视频-文本检索能力。
- 文件数量: 700个
- 文件格式: 每个文件(例如P01_01.pth.tar)包含一个TxD特征向量,其中T代表视频长度,D为768。
使用指南
- 参考代码: 请参考EgoInstructor项目代码以了解详细使用方法。
引用信息
@article{xu2024retrieval, title={Retrieval-augmented egocentric video captioning}, author={Xu, Jilan and Huang, Yifei and Hou, Junlin and Chen, Guo and Zhang, Yuejie and Feng, Rui and Xie, Weidi}, journal={arXiv preprint arXiv:2401.00789}, year={2024} }



