vidore/vidore_v3_energy_mteb_format
收藏Hugging Face2025-11-05 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/vidore/vidore_v3_energy_mteb_format
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个配置,每个配置具有特定的特征和数据分割。配置包括不同的语言,如英语、法语、德语、意大利语、葡萄牙语和西班牙语。每个配置都有一个语料库、qrels(查询相关性判断)和查询。数据集旨在用于视觉文档检索、图像到文本和文本到图像等任务。数据集是MTEB(大规模文本嵌入基准)的一部分,用于评估嵌入模型。数据集是多种语言的,并包括注释创建者。数据集的许可证是CC BY 4.0。
The dataset consists of multiple configurations, each with specific features and data splits. The configurations include different languages such as English, French, German, Italian, Portuguese, and Spanish. Each configuration has a corpus, qrels (query relevance judgments), and queries. The dataset is designed for tasks such as visual-document retrieval, image-to-text, and text-to-image. The dataset is part of the MTEB (Massive Text Embedding Benchmark) and is used for evaluating embedding models. The dataset is multilingual and includes annotations creators. The license for the dataset is CC BY 4.0.
提供机构:
vidore



