InternVL-Data
收藏InternVL-Data 数据集概述
基本信息
- 语言: 多语言 (multilingual)
- 许可证: CC BY 4.0 (cc-by-4.0)
- 任务类别: 图像到文本 (image-to-text)、问答 (question-answering)
- 数据规模: 10M < n < 100M (10M到100M之间)
数据集简介
InternVL3开放数据集旨在支持多模态大语言模型(MLLMs)的研究与开发,特别是涉及图像、文本和视频理解的任务。数据集由多种来源的数据组成,包括精选的开源数据集、自合成数据集以及从互联网收集的数据。
数据发布计划
- 第一阶段: 发布InternVL2.5和InternVL3的SFT数据。
- 发布时间: 计划在未来2到4周内陆续上传数据,首先发布InternVL2.5的SFT数据,随后发布InternVL3的SFT数据。
数据列表
- InternVL2.5-SFT: 待发布 (TODO)
- InternVL3-SFT: 待发布 (TODO)
引用信息
如果使用此数据集,请考虑引用以下论文: BibTeX @article{zhu2025internvl3, title={InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models}, author={Zhu, Jinguo and Wang, Weiyun and Chen, Zhe and Liu, Zhaoyang and Ye, Shenglong and Gu, Lixin and Duan, Yuchen and Tian, Hao and Su, Weijie and Shao, Jie and others}, journal={arXiv preprint arXiv:2504.10479}, year={2025} } @article{chen2024expanding, title={Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling}, author={Chen, Zhe and Wang, Weiyun and Cao, Yue and Liu, Yangzhou and Gao, Zhangwei and Cui, Erfei and Zhu, Jinguo and Ye, Shenglong and Tian, Hao and Liu, Zhaoyang and others}, journal={arXiv preprint arXiv:2412.05271}, year={2024} } @article{chen2024far, title={How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites}, author={Chen, Zhe and Wang, Weiyun and Tian, Hao and Ye, Shenglong and Gao, Zhangwei and Cui, Erfei and Tong, Wenwen and Hu, Kongzhi and Luo, Jiapeng and Ma, Zheng and others}, journal={arXiv preprint arXiv:2404.16821}, year={2024} } @inproceedings{chen2024internvl, title={Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks}, author={Chen, Zhe and Wu, Jiannan and Wang, Wenhai and Su, Weijie and Chen, Guo and Xing, Sen and Zhong, Muyan and Zhang, Qinglong and Zhu, Xizhou and Lu, Lewei and others}, booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition}, pages={24185--24198}, year={2024} }
相关资源
- GitHub: https://github.com/OpenGVLab/InternVL
- 论文:
- InternVL 1.0: https://huggingface.co/papers/2312.14238
- InternVL 1.5: https://huggingface.co/papers/2404.16821
- InternVL 2.5: https://huggingface.co/papers/2412.05271
- InternVL2.5-MPO: https://huggingface.co/papers/2411.10442
- InternVL3: https://huggingface.co/papers/2504.10479




