Y-J-Ju/MIRe_ViD2R

Name: Y-J-Ju/MIRe_ViD2R
Creator: Y-J-Ju
Published: 2025-02-21 07:44:09
License: 暂无描述

Hugging Face2025-02-21 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/Y-J-Ju/MIRe_ViD2R

下载链接

链接失效反馈

官方服务：

资源简介：

MIRe多模态检索预训练数据集包含了图像和文本查询-段落对，用于训练不需要在对齐阶段融合文本特征的多模态检索系统。数据集由简洁的问答对转换成扩展段落构成，更好地模拟了现实世界中的检索任务。数据集包含了来自ST-VQA、TextVQA、LLaVAR、Instruct4V和LLaVA-1.5等多个来源的样本，保证了多样性，并排除了WiT数据。

The MIRe Pre-training Dataset for Multimodal Query Retrieval consists of image-text query-passage pairs for training multimodal retrieval systems that do not fuse text features during the alignment stage. The dataset is composed of concise question-answer pairs converted into extended passages, better mimicking real-world retrieval tasks. It includes samples from sources like ST-VQA, TextVQA, LLaVAR, Instruct4V, and LLaVA-1.5, ensuring diversity and excluding WiT data.

提供机构：

Y-J-Ju

5,000+

优质数据集

54 个

任务类型

进入经典数据集