opendiffusionai/laion2b-aesthetic-squareish-captions
收藏Hugging Face2025-11-14 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/opendiffusionai/laion2b-aesthetic-squareish-captions
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
size_categories:
- 100K<n<1M
---
This dataset contains image captions generated from [LAION2B-en-aesthetic-square](https://huggingface.co/opendiffusionai/lain2b-en-aesthetic-square).
We started with ~300K images after size filtering (2.5k max w/h), a portion of the images were skipped due to inaccessible URLs.<br>
The captions were generated over ~30 hours using [Qwen3-VL-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-30B-A3B-Instruct) on 1xH100 running SGLang with the prompt `Describe the content of the provided image in detail, in plaintext. Do not make assumptions. Do not use special formatting. Avoid purple prose.`

Total samples: 209,141<br>
Average caption length: 284.6 characters<br>
Median caption length: 231.0 characters
提供机构:
opendiffusionai



