five

qidouxiong619/dreamlip_long_captions

收藏
Hugging Face2024-09-23 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/qidouxiong619/dreamlip_long_captions
下载链接
链接失效反馈
官方服务:
资源简介:
DreamLIP-30M数据集包含约3000万张图像的详细长标注,这些标注是通过预训练的多模态大语言模型生成的,平均长度为247个字符。此外,还提供了通过提示生成的简短描述。数据集的主要用途是文本到图像和零样本分类任务,语言为英语,数据规模在1000万到1亿之间。数据集的创建者包括Kecheng Zheng等人,遵循CC-BY-4.0许可。数据集基于CC3M,并感谢InstructBLIP、ShareGPT4V和LLAVA的预训练模型。

DreamLIP-Long-Captions is a dataset consisting of ~30M image annotations, i.e. detailed long captions. In contrast with the curated style of other synthetic image caption annotations, DreamLIP-30M utilizes pre-trained Multi-modality Large Language Model to obtain detailed descriptions with an average length of 247. More precisely, the detailed descriptions are generated by asking the ShareGPT4V/InstructBLIP/LLava1.5 the question Describe the image in detail. Meanwhile, we also provide the generated short caption by prompting Describe the image in one sentence. The question of detailed long captions has little impact on the diversity of answers, so we can obtain comprehensive captions of each image.
提供机构:
qidouxiong619
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作