DataDepictQA

Name: DataDepictQA
Creator: maas
Published: 2026-05-08 19:18:31
License: 暂无描述

魔搭社区2026-05-08 更新2024-08-31 收录

下载链接：

https://modelscope.cn/datasets/zhiyuanyou/DataDepictQA

下载链接

链接失效反馈

官方服务：

资源简介：

# DataDepictQA Datasets of the papers in [DepictQA project](https://depictqa.github.io/): - DepictQA-Wild (DepictQA-v2): [paper](https://arxiv.org/abs/2405.18842) / [project page](https://depictqa.github.io/depictqa-wild/) / [code](https://github.com/XPixelGroup/DepictQA). Zhiyuan You, Jinjin Gu, Zheyuan Li, Xin Cai, Kaiwen Zhu, Chao Dong, Tianfan Xue, "Descriptive Image Quality Assessment in the Wild," arXiv preprint arXiv:2405.18842, 2024. - DepictQA-v1: [paper](https://arxiv.org/abs/2312.08962) / [project page](https://depictqa.github.io/depictqa-v1/) / [code](https://github.com/XPixelGroup/DepictQA). Zhiyuan You, Zheyuan Li, Jinjin Gu, Zhenfei Yin, Tianfan Xue, Chao Dong, "Depicting beyond scores: Advancing image quality assessment through multi-modal language models," ECCV, 2024. ## Dataset Overview - Training DepictQA-v2 requires: - KADIS700K - BAPPS - PIPAL - KADID10K - DetailDescriptionLAMM - Training DepictQA-v1 requires: - BAPPS - PIPAL - KADID10K - DetailDescriptionLAMM ## Dataset Construction **Source codes** for dataset construction are provided in [here](https://github.com/XPixelGroup/DepictQA/tree/main/build_datasets). Our datasets are constructed based on existing datasets. Therefore, some source images should be downloaded and re-arranged to construct the datasets. Bellow we provide a detailed instruction. ### KADIS700K After downloading, extract the images from _*.tar.gz_ files. The directory of `DataDepictQA/KADIS700K` should be as follows. The meanings of directory names can be found in **Abbreviations** section of our [source codes](https://github.com/XPixelGroup/DepictQA/tree/main/build_datasets) for dataset construction. ``` |-- DataDepictQA |-- KADIS700K |-- A_md_brief |-- A_md_detail |-- A_sd_brief |-- A_sd_detail |-- AB_md_detail |-- AB_sd_detail |-- metas_combine |-- ref_imgs_s224 |-- refA_md_brief |-- refA_md_detail |-- refA_sd_brief |-- refA_sd_detail |-- refAB_md_detail |-- refAB_sd_detail ``` ### BAPPS 1. Download the BAPPS dataset (**2AFC Train set** and **2AFC Val set**) from [here](https://github.com/richzhang/PerceptualSimilarity/blob/master/scripts/download_dataset.sh). 2. Place the downloaded images in `DataDepictQA/BAPPS` as follows. ``` |-- DataDepictQA |-- BAPPS |-- images |-- mbapps_test_refA_s64 |-- mbapps_test_refAB_s64 |-- twoafc_train (downloaded) |-- twoafc_val (downloaded) |-- resize_bapps.py |-- metas ``` 3. The downloaded images are 256 x 256 patches, which are resized from the original 64 x 64 patches. Resizing does not influence comparison results (_i.e._, Image A or Image B is better), but influences the detailed reasoning tasks since additional pixelation distortion is introduced. Therefore, we resize these images back to their original 64 x 64 resolution. ``` cd DataDepictQA/BAPPS/images python resize_bapps.py ``` 4. The constructed BAPPS directory should be as follows. ``` |-- DataDepictQA |-- BAPPS |-- images |-- mbapps_test_refA_s64 |-- mbapps_test_refAB_s64 |-- twoafc_train (downloaded) |-- twoafc_train_s64 (created by resize_bapps.py) |-- twoafc_val (downloaded) |-- twoafc_val_s64 (created by resize_bapps.py) |-- resize_bapps.py |-- metas ``` ### PIPAL 1. Download the PIPAL dataset (**train set**) from [here](https://github.com/HaomingCai/PIPAL-dataset). 2. Place the downloaded images in `DataDepictQA/PIPAL` as follows. ``` |-- DataDepictQA |-- PIPAL |-- images |-- Distortion_1 (downloaded) |-- Distortion_2 (downloaded) |-- Distortion_3 (downloaded) |-- Distortion_4 (downloaded) |-- Train_Ref (downloaded) |-- metas ``` ### KADID10K 1. Download the KADID10K dataset from [here](https://database.mmsp-kn.de/kadid-10k-database.html). 2. Place the downloaded images in `DataDepictQA/KADID10K` as follows. ``` |-- DataDepictQA |-- KADID10K |-- images/*.png (downloaded) |-- metas |-- metas_srcc_plcc_voting ``` ### CSIQ 1. Download the CSIQ dataset from [here](https://s2.smu.edu/~eclarson/csiq.html). 2. Place the downloaded images in `DataDepictQA/CSIQ` as follows. ``` |-- DataDepictQA |-- CSIQ |-- images |-- src_imgs (downloaded) |-- dst_imgs (downloaded) |-- metas |-- metas_srcc_plcc_voting ``` ### TID2013 1. Download the TID2013 dataset from [here](https://www.ponomarenko.info/tid2013.htm). 2. Place the downloaded images in `DataDepictQA/TID2013` as follows. ``` |-- DataDepictQA |-- TID2013 |-- images |-- reference_images (downloaded) |-- distorted_images (downloaded) |-- metas |-- metas_srcc_plcc_voting ``` ### DetailDescriptionLAMM 1. Download the LAMM Detailed Description dataset (**coco_images**) from [here](https://opendatalab.com/LAMM/LAMM/tree/main/raw/2D_Instruct). 2. Place the downloaded images in `DataDepictQA/DetailDescriptionLAMM` as follows. ``` |-- DataDepictQA |-- DetailDescriptionLAMM |-- coco_images/*.jpg (downloaded) |-- metas ```

# DataDepictQA 本数据集为[DepictQA项目](https://depictqa.github.io/)相关论文的配套数据集集合： - DepictQA-Wild（DepictQA-v2）：[论文](https://arxiv.org/abs/2405.18842) / [项目页面](https://depictqa.github.io/depictqa-wild/) / [代码](https://github.com/XPixelGroup/DepictQA)。游志远、谷津津、李泽远、蔡昕、朱凯文、董超、薛天凡，《野外场景下的描述性图像质量评估》，arXiv预印本 arXiv:2405.18842，2024年。 - DepictQA-v1：[论文](https://arxiv.org/abs/2312.08962) / [项目页面](https://depictqa.github.io/depictqa-v1/) / [代码](https://github.com/XPixelGroup/DepictQA)。游志远、李泽远、谷津津、尹振飞、薛天凡、董超，《超越分数：通过多模态大语言模型（Multi-modal Language Model）推进图像质量评估》，ECCV 2024。 ## 数据集概览 - 训练DepictQA-v2需以下列数据集为基础： - KADIS700K - BAPPS - PIPAL - KADID10K - DetailDescriptionLAMM - 训练DepictQA-v1需以下列数据集为基础： - BAPPS - PIPAL - KADID10K - DetailDescriptionLAMM ## 数据集构建 **源代码** 数据集构建的配套源代码已在[此处](https://github.com/XPixelGroup/DepictQA/tree/main/build_datasets)公开。本数据集基于现有数据集构建，因此需下载部分源图像并进行重新整理，方可完成数据集构建。下文将提供详细操作说明。 ### KADIS700K 下载完成后，从*.tar.gz文件中解压图像。`DataDepictQA/KADIS700K`的目录结构应如下所示。目录名称的含义可参见数据集构建源代码的[缩写说明部分](https://github.com/XPixelGroup/DepictQA/tree/main/build_datasets)。 |-- DataDepictQA |-- KADIS700K |-- A_md_brief |-- A_md_detail |-- A_sd_brief |-- A_sd_detail |-- AB_md_detail |-- AB_sd_detail |-- metas_combine |-- ref_imgs_s224 |-- refA_md_brief |-- refA_md_detail |-- refA_sd_brief |-- refA_sd_detail |-- refAB_md_detail |-- refAB_sd_detail ### BAPPS 1. 从[此处](https://github.com/richzhang/PerceptualSimilarity/blob/master/scripts/download_dataset.sh)下载BAPPS数据集（**2AFC（二择一强制选择，Two-Alternative Forced Choice）训练集**与**2AFC验证集**）。 2. 将下载的图像按如下结构放置在`DataDepictQA/BAPPS`目录中： |-- DataDepictQA |-- BAPPS |-- images |-- mbapps_test_refA_s64 |-- mbapps_test_refAB_s64 |-- twoafc_train (downloaded) |-- twoafc_val (downloaded) |-- resize_bapps.py |-- metas 3. 下载的图像为256×256像素块，由原始64×64像素块缩放而来。缩放操作不会影响对比结果（即图像A与图像B的优劣判断），但会对细节推理任务产生影响，因为缩放会引入额外的像素化失真。因此，我们将这些图像重新缩放至原始的64×64分辨率。 cd DataDepictQA/BAPPS/images python resize_bapps.py 4. 构建完成后的BAPPS目录结构应如下所示： |-- DataDepictQA |-- BAPPS |-- images |-- mbapps_test_refA_s64 |-- mbapps_test_refAB_s64 |-- twoafc_train (downloaded) |-- twoafc_train_s64 (created by resize_bapps.py) |-- twoafc_val (downloaded) |-- twoafc_val_s64 (created by resize_bapps.py) |-- resize_bapps.py |-- metas ### PIPAL 1. 从[此处](https://github.com/HaomingCai/PIPAL-dataset)下载PIPAL数据集（**训练集**）。 2. 将下载的图像按如下结构放置在`DataDepictQA/PIPAL`目录中： |-- DataDepictQA |-- PIPAL |-- images |-- Distortion_1 (downloaded) |-- Distortion_2 (downloaded) |-- Distortion_3 (downloaded) |-- Distortion_4 (downloaded) |-- Train_Ref (downloaded) |-- metas ### KADID10K 1. 从[此处](https://database.mmsp-kn.de/kadid-10k-database.html)下载KADID10K数据集。 2. 将下载的图像按如下结构放置在`DataDepictQA/KADID10K`目录中： |-- DataDepictQA |-- KADID10K |-- images/*.png (downloaded) |-- metas |-- metas_srcc_plcc_voting ### CSIQ 1. 从[此处](https://s2.smu.edu/~eclarson/csiq.html)下载CSIQ数据集。 2. 将下载的图像按如下结构放置在`DataDepictQA/CSIQ`目录中： |-- DataDepictQA |-- CSIQ |-- images |-- src_imgs (downloaded) |-- dst_imgs (downloaded) |-- metas |-- metas_srcc_plcc_voting ### TID2013 1. 从[此处](https://www.ponomarenko.info/tid2013.htm)下载TID2013数据集。 2. 将下载的图像按如下结构放置在`DataDepictQA/TID2013`目录中： |-- DataDepictQA |-- TID2013 |-- images |-- reference_images (downloaded) |-- distorted_images (downloaded) |-- metas |-- metas_srcc_plcc_voting ### DetailDescriptionLAMM 1. 从[此处](https://opendatalab.com/LAMM/LAMM/tree/main/raw/2D_Instruct)下载LAMM详细描述数据集（**coco_images**）。 2. 将下载的图像按如下结构放置在`DataDepictQA/DetailDescriptionLAMM`目录中： |-- DataDepictQA |-- DetailDescriptionLAMM |-- coco_images/*.jpg (downloaded) |-- metas

提供机构：

maas

创建时间：

2024-07-07

5,000+

优质数据集

54 个

任务类型

进入经典数据集