five

opendiffusionai/laion2b-aesthetic-squareish-captions

收藏
Hugging Face2025-11-14 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/opendiffusionai/laion2b-aesthetic-squareish-captions
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en size_categories: - 100K<n<1M --- This dataset contains image captions generated from [LAION2B-en-aesthetic-square](https://huggingface.co/opendiffusionai/lain2b-en-aesthetic-square). We started with ~300K images after size filtering (2.5k max w/h), a portion of the images were skipped due to inaccessible URLs.<br> The captions were generated over ~30 hours using [Qwen3-VL-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-30B-A3B-Instruct) on 1xH100 running SGLang with the prompt `Describe the content of the provided image in detail, in plaintext. Do not make assumptions. Do not use special formatting. Avoid purple prose.` ![Text Length Distribution](assets/text_length_distribution.png) Total samples: 209,141<br> Average caption length: 284.6 characters<br> Median caption length: 231.0 characters
提供机构:
opendiffusionai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作