five

CIRCL/circl-ail-dataset-01-updated

收藏
Hugging Face2026-03-19 更新2026-04-05 收录
下载链接:
https://hf-mirror.com/datasets/CIRCL/circl-ail-dataset-01-updated
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 --- # circl-ail-dataset-01-updated This repository contains an updated and enriched version of the [original CIRCL Images AIL Dataset](https://www.circl.lu/opendata/circl-ail-dataset-01/). The updated dataset adds two kinds of derived metadata for each image: - **LLM-generated image descriptions** - **Open Alleged PhotoDNA fingerprints** These records are intended to support research, analysis, and experimentation on image classification, content understanding, and similarity matching. ## Dataset contents The repository includes processed JSON records derived from the original dataset. ### Open Alleged PhotoDNA fingerprints The following files contain the [Open Alleged PhotoDNA](https://github.com/adulau/open-alleged-photodna) fingerprints: - `folder_0.json` - `folder_1_SECONDPASS.json` These files store fingerprint data associated with the images from the original CIRCL Images AIL Dataset. ### LLM image descriptions Each image is associated with a JSON record containing a machine-generated description produced by a large language model (LLM). Each record includes: - the image filename - the model used - the prompt sent to the model - the generated response Example: ```json { "image": "enormous-ugly-used-mixture.png", "model": "qwen3-vl:32b", "prompt": "Describe the image", "response": "This image is a screenshot of the Spanish-language website ..." }
提供机构:
CIRCL
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作