InternVid-Full
收藏魔搭社区2025-12-04 更新2024-12-28 收录
下载链接:
https://modelscope.cn/datasets/OpenGVLab/InternVid-Full
下载链接
链接失效反馈官方服务:
资源简介:
# InternVid
## Dataset Description
- **Homepage:** [InternVid](https://github.com/OpenGVLab/InternVideo/tree/main/Data/InternVid)
- **Repository:** [OpenGVLab](https://github.com/OpenGVLab/InternVideo/tree/main/Data/InternVid)
- **Paper:** [2307.06942](https://arxiv.org/pdf/2307.06942.pdf)
- **Point of Contact:** mailto:[InternVideo](gvx-sh@pjlab.org.cn)
## InternVid-Full
We present InternVid-230M, a full set of this dataset, consisting of 230 million video clips, with generated high-quality captions for publicly available web videos.
## Download
The 230M samples are provided in jsonlines file. Columns include the videoID, timestamps, generated caption and their UMT similarity scores.
## How to Use
```
from datasets import load_dataset
dataset = load_dataset("OpenGVLab/InternVid-Full")
```
## Method

## Citation
If you find this work useful for your research, please consider citing InternVid. Your acknowledgement would greatly help us in continuing to contribute resources to the research community.
```
@article{wang2023internvid,
title={InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation},
author={Wang, Yi and He, Yinan and Li, Yizhuo and Li, Kunchang and Yu, Jiashuo and Ma, Xin and Chen, Xinyuan and Wang, Yaohui and Luo, Ping and Liu, Ziwei and Wang, Yali and Wang, Limin and Qiao, Yu},
journal={arXiv preprint arXiv:2307.06942},
year={2023}
}
@article{wang2022internvideo,
title={InternVideo: General Video Foundation Models via Generative and Discriminative Learning},
author={Wang, Yi and Li, Kunchang and Li, Yizhuo and He, Yinan and Huang, Bingkun and Zhao, Zhiyu and Zhang, Hongjie and Xu, Jilan and Liu, Yi and Wang, Zun and Xing, Sen and Chen, Guo and Pan, Junting and Yu, Jiashuo and Wang, Yali and Wang, Limin and Qiao, Yu},
journal={arXiv preprint arXiv:2212.03191},
year={2022}
}
```
# InternVid
## 数据集描述
- **数据集主页**: [InternVid](https://github.com/OpenGVLab/InternVideo/tree/main/Data/InternVid)
- **代码仓库**: [OpenGVLab](https://github.com/OpenGVLab/InternVideo/tree/main/Data/InternVid)
- **关联论文**: [2307.06942](https://arxiv.org/pdf/2307.06942.pdf)
- **联系方式**: mailto:[InternVideo](gvx-sh@pjlab.org.cn)
## InternVid-Full
我们推出了本数据集的完整版本InternVid-230M,该数据集包含2.3亿个视频片段,并为公开可用的网络视频生成了高质量的字幕说明。
## 下载说明
该2.3亿条样本以jsonlines格式文件提供,字段包含videoID、时间戳、生成的字幕以及对应的UMT相似度得分。
## 使用方法
from datasets import load_dataset
dataset = load_dataset("OpenGVLab/InternVid-Full")
## 方法

## 引用说明
若本数据集对你的研究有所帮助,请考虑引用InternVid。你的认可将极大助力我们持续为研究社区贡献相关资源。
@article{wang2023internvid,
title={InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation},
author={Wang, Yi and He, Yinan and Li, Yizhuo and Li, Kunchang and Yu, Jiashuo and Ma, Xin and Chen, Xinyuan and Wang, Yaohui and Luo, Ping and Liu, Ziwei and Wang, Yali and Wang, Limin and Qiao, Yu},
journal={arXiv preprint arXiv:2307.06942},
year={2023}
}
@article{wang2022internvideo,
title={InternVideo: General Video Foundation Models via Generative and Discriminative Learning},
author={Wang, Yi and Li, Kunchang and Li, Yizhuo and He, Yinan and Huang, Bingkun and Zhao, Zhiyu and Zhang, Hongjie and Xu, Jilan and Liu, Yi and Wang, Zun and Xing, Sen and Chen, Guo and Pan, Junting and Yu, Jiashuo and Wang, Yali and Wang, Limin and Qiao, Yu},
journal={arXiv preprint arXiv:2212.03191},
year={2022}
}
提供机构:
maas
创建时间:
2024-12-26



