five

CLIP-Embedded RedCaps Text-Image Dataset

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13137119
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset was created by applying the CLIP embedding to the RedCaps dataset. Queries are generated by OpenAI's GPT model simulating textual queries searching multimodal content, embedded via CLIP. The data was curated by Desai, Kaul, Aysola, and Johnson from data collected by Reddit, and further curated into vector data by Engels for this work.  Usage of the dataset itself is subject to Reddit terms, Reddit User Agreeement, Content Policy, and Privacy Policy (quoted from Desai et. al.'s accompanying paper for the image-and-text dataset). Usage of the queries are subject to OpenAI terms. Among others, OpenAI terms prohibits using the query component of this dataset to develop models that compete against OpenAI.
创建时间:
2024-07-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作