InternVL2.5-MPO
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/OpenGVLab/InternVL
下载链接
链接失效反馈官方服务:
资源简介:
该数据集采用了InternVL2.5-MPO模型作为文档图像机器翻译训练框架的一部分,该模型融合了大型视觉-语言模型。在训练过程中,该模型使用了DeepSpeed的zero3-offload技术,在8个GPU上进行了配置特定的批处理大小和学习率的训练。该任务旨在实现端到端的文档图像机器翻译。
This dataset employs the InternVL2.5-MPO model, which integrates large vision-language models, as part of the training framework for document image machine translation. During training, the model adopts DeepSpeed's zero3-offload technology and is trained with tailored batch sizes and learning rates across 8 GPUs. This task aims to achieve end-to-end document image machine translation.
提供机构:
Huawei Translation Service Center



