five

vidore/vidore_v3_computer_science_mteb_format

收藏
Hugging Face2025-11-05 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/vidore/vidore_v3_computer_science_mteb_format
下载链接
链接失效反馈
官方服务:
资源简介:
这是一个多语言的文本到图像和图像到文本检索数据集,用于学术领域。数据集包含多种语言的数据,如英语、法语、德语、意大利语、葡萄牙语和西班牙语。数据集由Vidore v3计算机科学数据集派生而来。它包含不同的测试数据分割,包括图像和文本数据及其相应的ID。数据集遵循CC BY 4.0许可,并且是多语言的。它主要用于视觉文档检索任务。

This is a multilingual dataset for text-to-image and image-to-text retrieval tasks, specifically designed for the academic domain. The dataset includes data in multiple languages such as English, French, German, Italian, Portuguese, and Spanish. It is derived from the Vidore v3 Computer Science dataset. The dataset contains different types of test splits, including image and text data along with their corresponding IDs. The dataset is licensed under CC BY 4.0 and is multilingual, supporting multiple languages. It is primarily used for visual-document retrieval tasks.
提供机构:
vidore
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作