racineai/OGC_MEGA_MultiDomain_DocRetrieval
收藏Hugging Face2025-08-23 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/racineai/OGC_MEGA_MultiDomain_DocRetrieval
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于视觉文档检索模型训练的全面数据集,包含了来自不同领域的文档图像和对应的查询文本,支持多语言,包括英语、法语、德语、西班牙语和意大利语。数据集旨在通过提供正例和负例来增强模型的辨别能力,并优化训练效率。
This is a comprehensive dataset for training visual document retrieval models, including document images and corresponding query texts from different fields, supporting multiple languages such as English, French, German, Spanish, and Italian. The dataset is designed to enhance model discrimination capabilities by providing positive and negative examples, and to optimize training efficiency.
提供机构:
racineai



