mteb/JinaVDRTweetStockSyntheticsRetrieval
收藏Hugging Face2025-10-21 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/mteb/JinaVDRTweetStockSyntheticsRetrieval
下载链接
链接失效反馈官方服务:
资源简介:
JinaVDRTweetStockSyntheticsRetrieval是一个多语言数据集,用于视觉文档检索、图像到文本和文本到图像任务。该数据集包含图像和文本数据,支持阿拉伯语、德语、英语、法语、印地语、匈牙利语、日语、俄语、西班牙语和中文。数据集分为测试集,用于评估模型。该数据集来源于jinaai/tweet-stock-synthetic-retrieval_beir数据集,是MTEB基准的一部分。数据集支持多语言,并包含图像和文本数据。
JinaVDRTweetStockSyntheticsRetrieval is a multilingual dataset designed for visual-document retrieval, image-to-text, and text-to-image tasks. The dataset includes image and text data, supporting languages such as Arabic, German, English, French, Hindi, Hungarian, Japanese, Russian, Spanish, and Chinese. It is split into test splits for evaluation purposes. The dataset is derived from the jinaai/tweet-stock-synthetic-retrieval_beir dataset, which is part of the MTEB benchmark. It is multilingual and includes both image and text data.
提供机构:
mteb



