InternVid-Full

Name: InternVid-Full
Creator: maas
Published: 2025-12-04 16:19:35
License: 暂无描述

魔搭社区2025-12-04 更新2024-12-28 收录

下载链接：

https://modelscope.cn/datasets/OpenGVLab/InternVid-Full

下载链接

链接失效反馈

官方服务：

资源简介：

# InternVid ## Dataset Description - **Homepage:** [InternVid](https://github.com/OpenGVLab/InternVideo/tree/main/Data/InternVid) - **Repository:** [OpenGVLab](https://github.com/OpenGVLab/InternVideo/tree/main/Data/InternVid) - **Paper:** [2307.06942](https://arxiv.org/pdf/2307.06942.pdf) - **Point of Contact:** mailto:[InternVideo](gvx-sh@pjlab.org.cn) ## InternVid-Full We present InternVid-230M, a full set of this dataset, consisting of 230 million video clips, with generated high-quality captions for publicly available web videos. ## Download The 230M samples are provided in jsonlines file. Columns include the videoID, timestamps, generated caption and their UMT similarity scores. ## How to Use ``` from datasets import load_dataset dataset = load_dataset("OpenGVLab/InternVid-Full") ``` ## Method ![Caption Method](caption_fig.jpg) ## Citation If you find this work useful for your research, please consider citing InternVid. Your acknowledgement would greatly help us in continuing to contribute resources to the research community. ``` @article{wang2023internvid, title={InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation}, author={Wang, Yi and He, Yinan and Li, Yizhuo and Li, Kunchang and Yu, Jiashuo and Ma, Xin and Chen, Xinyuan and Wang, Yaohui and Luo, Ping and Liu, Ziwei and Wang, Yali and Wang, Limin and Qiao, Yu}, journal={arXiv preprint arXiv:2307.06942}, year={2023} } @article{wang2022internvideo, title={InternVideo: General Video Foundation Models via Generative and Discriminative Learning}, author={Wang, Yi and Li, Kunchang and Li, Yizhuo and He, Yinan and Huang, Bingkun and Zhao, Zhiyu and Zhang, Hongjie and Xu, Jilan and Liu, Yi and Wang, Zun and Xing, Sen and Chen, Guo and Pan, Junting and Yu, Jiashuo and Wang, Yali and Wang, Limin and Qiao, Yu}, journal={arXiv preprint arXiv:2212.03191}, year={2022} } ```

# InternVid ## 数据集描述 - **数据集主页**: [InternVid](https://github.com/OpenGVLab/InternVideo/tree/main/Data/InternVid) - **代码仓库**: [OpenGVLab](https://github.com/OpenGVLab/InternVideo/tree/main/Data/InternVid) - **关联论文**: [2307.06942](https://arxiv.org/pdf/2307.06942.pdf) - **联系方式**: mailto:[InternVideo](gvx-sh@pjlab.org.cn) ## InternVid-Full 我们推出了本数据集的完整版本InternVid-230M，该数据集包含2.3亿个视频片段，并为公开可用的网络视频生成了高质量的字幕说明。 ## 下载说明该2.3亿条样本以jsonlines格式文件提供，字段包含videoID、时间戳、生成的字幕以及对应的UMT相似度得分。 ## 使用方法 from datasets import load_dataset dataset = load_dataset("OpenGVLab/InternVid-Full") ## 方法 ![Caption Method](caption_fig.jpg) ## 引用说明若本数据集对你的研究有所帮助，请考虑引用InternVid。你的认可将极大助力我们持续为研究社区贡献相关资源。 @article{wang2023internvid, title={InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation}, author={Wang, Yi and He, Yinan and Li, Yizhuo and Li, Kunchang and Yu, Jiashuo and Ma, Xin and Chen, Xinyuan and Wang, Yaohui and Luo, Ping and Liu, Ziwei and Wang, Yali and Wang, Limin and Qiao, Yu}, journal={arXiv preprint arXiv:2307.06942}, year={2023} } @article{wang2022internvideo, title={InternVideo: General Video Foundation Models via Generative and Discriminative Learning}, author={Wang, Yi and Li, Kunchang and Li, Yizhuo and He, Yinan and Huang, Bingkun and Zhao, Zhiyu and Zhang, Hongjie and Xu, Jilan and Liu, Yi and Wang, Zun and Xing, Sen and Chen, Guo and Pan, Junting and Yu, Jiashuo and Wang, Yali and Wang, Limin and Qiao, Yu}, journal={arXiv preprint arXiv:2212.03191}, year={2022} }

提供机构：

maas

创建时间：

2024-12-26

5,000+

优质数据集

54 个

任务类型

进入经典数据集