fancheng0919/Retrievatar
收藏Hugging Face2025-12-19 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/fancheng0919/Retrievatar
下载链接
链接失效反馈官方服务:
资源简介:
Retrievatar 是一个多模态数据集,旨在增强视觉语言模型的检索增强生成能力,特别关注虚构的动漫角色和现实世界的各界名人。该数据集包含 100,000 个样本,是从一个更大的合成图像-文本语料库中提取的。图像描述使用 Gemini-2.5-pro GA 模型生成,并通过 Gemini API 利用 Google 搜索进行落地。数据集支持多种语言(英语、中文、日语、德语),并反映了 2025 年 8 月的互联网状态。
Retrievatar is a multimodal dataset designed to enhance the retrieval-augmented generation capabilities of vision-language models, specifically focusing on fictional anime characters and real-world celebrities across various fields. This release represents a subset of 100,000 samples extracted from a significantly larger synthetic image-text corpus. The image captions were generated using the Gemini-2.5-pro GA model, leveraging Grounding with Google Search via the Gemini API. The dataset features multilingual captions (English, Chinese, Japanese, German) and reflects the state of the web as of August 2025.
提供机构:
fancheng0919



