undefined443/LAION-Art-recaption
收藏Hugging Face2026-03-31 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/undefined443/LAION-Art-recaption
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
task_categories:
- image-to-text
- text-to-image
language:
- en
tags:
- art
- recaption
- laion
- vision-language
size_categories:
- 1M<n<10M
configs:
- config_name: default
data_files:
- split: train
path: data.parquet
---
# LAION-Art Recaption
Recaptioned subset of the [LAION-Art](https://huggingface.co/datasets/laion/laion-art) dataset using Qwen2.5-VL-7B-Instruct.
## Dataset Description
This dataset contains detailed recaptions for LAION-Art images generated by a vision-language model. Only successfully recaptioned samples are included.
**Total samples: 1,410,704**
### Columns
| Column | Type | Description |
| ----------------- | ------- | --------------------------------- |
| `url` | string | Original image URL |
| `key` | string | Unique identifier for each sample |
| `width` | int32 | Image width |
| `height` | int32 | Image height |
| `aesthetic_score` | float32 | LAION aesthetic score |
| `caption` | string | Generated detailed caption |
### Recaption Model
- **Model**: [Qwen/Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct)
### Example
```
url: https://i.pinimg.com/236x/6d/41/13/6d41130ff19a9bf30023b2a4b14fdf6b.jpg
key: 000000012
width: 236
height: 300
aesthetic_score: 8
caption: A knitted cat toy sits on a table, ready to be played with.
```
## Usage
```python
from datasets import load_dataset
dataset = load_dataset("undefined443/LAION-Art-recaption")
```
## License
This dataset inherits the license from LAION-Art (CC-BY-4.0).
## Related
- [LAION-Art](https://huggingface.co/datasets/laion/laion-art) - Original dataset
- [Qwen2.5-VL](https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct) - Vision-language model used for recaptioning
提供机构:
undefined443



