recursal/OKReddit-Visionary
收藏Hugging Face2024-12-03 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/recursal/OKReddit-Visionary
下载链接
链接失效反馈官方服务:
资源简介:
OKReddit Visionary是一个包含约74K对图像问答对的数据集,主要用于研究和存档目的。该数据集由KaraKaraWitch整理,由Recursal.ai资助,并主要使用英语。数据集支持多种自然语言处理任务,如视觉问答和文本到图像(反之亦然)。数据集的结构允许使用webdataset加载,并包含多种图像格式。数据集的创建基于Reddit的特定子版块,如PeterExplainsTheJoke等,通过筛选高评分帖子和回复来确保质量。
OKReddit Visionary is a collection of approximately 74K pairs of image Question & Answers, primarily intended for research or archival purposes. The dataset was curated by KaraKaraWitch, funded by Recursal.ai, and mainly uses English. It supports a variety of natural language processing tasks, including visual questioning and text to image (and vice versa). The dataset structure allows loading with webdataset and includes multiple image formats. The creation of the dataset is based on specific Reddit subreddits, such as PeterExplainsTheJoke, ensuring quality by filtering high-scoring threads and replies.
提供机构:
recursal



