weiwu-ww/Recap-Long-Laion
收藏Hugging Face2024-11-25 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/weiwu-ww/Recap-Long-Laion
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含来自LAION-5B数据集的约4900万张图像的长描述。这些长描述是由预训练的多模态大语言模型(如ShareGPT4V、InstructBLIP、LLava1.5)生成的,使用的文本提示是“详细描述图像”。数据集的语言为英语,大小类别在10M到100M之间。数据集的许可证为CC-BY-4.0,图像链接与长描述一起分发,但单个图像的版权归各自所有。
This dataset consists of long captions of ~49M images from LAION-5B dataset. The long captions are generated by pre-trained Multi-modality Large Language Models (ShareGPT4V/InstructBLIP/LLava1.5) with the text prompt Describe the image in detail. The dataset is in English and falls under the size category of 10M to 100M. The dataset is distributed under a CC-BY-4.0 license, with image URLs and long captions, but individual images are under their own copyrights.
提供机构:
weiwu-ww



