CLIP-Embedded RedCaps Text-Image Dataset
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13137119
下载链接
链接失效反馈官方服务:
资源简介:
This dataset was created by applying the CLIP embedding to the RedCaps dataset. Queries are generated by OpenAI's GPT model simulating textual queries searching multimodal content, embedded via CLIP. The data was curated by Desai, Kaul, Aysola, and Johnson from data collected by Reddit, and further curated into vector data by Engels for this work.
Usage of the dataset itself is subject to Reddit terms, Reddit User Agreeement, Content Policy, and Privacy Policy (quoted from Desai et. al.'s accompanying paper for the image-and-text dataset). Usage of the queries are subject to OpenAI terms. Among others, OpenAI terms prohibits using the query component of this dataset to develop models that compete against OpenAI.
创建时间:
2024-07-30



