Cross-lingual Visual Pre-training for Multimodal Machine Translation

NIAID Data Ecosystem2026-03-12 收录

下载链接：

https://zenodo.org/records/4646961

下载链接

链接失效反馈

官方服务：

资源简介：

Supplements for the paper entitled "Cross-lingual Visual Pre-training for Multimodal Machine Translation" which is accepted by the EACL'2021 conference. Further instructions on how to use these resources are explained at https://github.com/ImperialNLP/VTLM A tarball that contains a custom train, valid, test split of Conceptual Captions (CC) dataset. The included TSV files havean additional column containing automatic German translations of the original English captions. We only provide samples for which we could download the images and extract meaningful features. This amounts to ~3M out ouf ~3.3M original CC samples. A tarball of the exact object detector checkpoint used for feature extraction. A tarball with pre-extracted Multi30k dataset features.

创建时间：

2021-04-22

5,000+

优质数据集

54 个

任务类型

进入经典数据集