OGC_ibm-research_REAL-MM-RAG
收藏魔搭社区2025-12-05 更新2025-08-30 收录
下载链接:
https://modelscope.cn/datasets/racineai/OGC_ibm-research_REAL-MM-RAG
下载链接
链接失效反馈官方服务:
资源简介:
# VDR_ibm-research_REAL-MM-RAG - Overview
## Dataset Summary
**VDR_ibm-research_REAL-MM-RAG is a multimodal dataset that combines text and image data, and support tasks such as DSE retrieval (RAG).**
## Dataset Creation
This dataset is a merge and shuffle of the following datasets in the VDR format:
- ibm-research/REAL-MM-RAG_TechSlides
- ibm-research/REAL-MM-RAG_TechReport
- ibm-research/REAL-MM-RAG_FinTabTrainSet
- ibm-research/REAL-MM-RAG_FinTabTrainSet_rephrased
- ibm-research/REAL-MM-RAG_FinSlides
- ibm-research/REAL-MM-RAG_FinReport
Rows with glitched or absent query or image were filtered out.
## Dataset Curators
- **Léo Appourchaux**
# VDR_ibm-research_REAL-MM-RAG —— 概述
## 数据集概览
**VDR_ibm-research_REAL-MM-RAG 是一款融合文本与图像数据的多模态数据集,可支持包括DSE检索在内的检索增强生成(Retrieval-Augmented Generation,RAG)相关任务。**
## 数据集构建
本数据集通过对以下采用VDR格式的数据集进行合并与洗牌操作得到:
- ibm-research/REAL-MM-RAG_TechSlides
- ibm-research/REAL-MM-RAG_TechReport
- ibm-research/REAL-MM-RAG_FinTabTrainSet
- ibm-research/REAL-MM-RAG_FinTabTrainSet_rephrased
- ibm-research/REAL-MM-RAG_FinSlides
- ibm-research/REAL-MM-RAG_FinReport
已过滤掉存在查询语句异常、缺失或图像缺失的样本行。
## 数据集维护者
- **Léo Appourchaux**
提供机构:
maas
创建时间:
2025-08-24



