[Sample] V-CAPE: Visual Consistency Assessment for Product Image Editing

Name: [Sample] V-CAPE: Visual Consistency Assessment for Product Image Editing
Creator: Zenodo
Published: 2026-05-02 14:34:01
License: 暂无描述

DataCite Commons2026-05-02 更新2026-05-07 收录

下载链接：

https://zenodo.org/doi/10.5281/zenodo.19979739

下载链接

链接失效反馈

官方服务：

资源简介：

V-CAPE Data Sample A small stratified sample from the V-CAPE benchmark, intended for anonymous reviewer quality inspection. The sample contains 3 examples per class (label × rejection reason) from each of the two dataset splits: Split Description Classes Samples vcape-r Real e-commerce editing pairs accepted, Geometry Artifacts, Wrong Orientation, Irrelevant Objects, Texture/Lighting/Color Issues, Background Issues, Generated-Main Image Mismatch 21 vcape-s Synthetic pairs rendered from 3D assets accepted, wrong_orientation, wrong_product_different_pt, wrong_product_same_pt 12 Files File Description data-00000-of-00001.arrow HuggingFace dataset in Arrow format containing all 33 samples dataset_info.json HuggingFace dataset metadata (features, num rows) state.json HuggingFace dataset state file croissant-sample.json Croissant metadata describing the dataset schema image_downloader.py Standalone script to download vcape-r source images by physical_id visualize_datasample.py Generates a self-contained HTML report for visual inspection report.html Pre-generated HTML visualization with the data sample (open in any browser) Dataset Schema Each row contains the following columns: Column Type Description split string Origin split: `vcape-r` or `vcape-s` physical_id string Source image identifier for vcape-r rows (empty for vcape-s) xsource_image Image Embedded source image for vcape-s rows (null for vcape-r) xtarget_image Image Generated/rendered target image object_description string Natural-language description of the product product_type string Product category (e.g., "sofa") pose string Target orientation / pose label input_prompt string Editing prompt for the transformation label string Human quality label: `accepted` / `rejected` (vcape-r) or `accept` / `reject` (vcape-s) rejection_reason string Structured rejection reason from a predefined taxonomy Quick Start 1. Install dependencies ```bash pip install datasets Pillow requests ``` 2. Load the dataset ```python from datasets import load_from_disk ds = load_from_disk(".") # run from this directory print(ds) print(ds[0]) ``` 3. Download vcape-r source images ```bash # Download all vcape-r source images into source_images/ python image_downloader.py # Or download specific images python image_downloader.py --pids 91FmmLIE3GL 71tLzgZoBQL ``` 4. Visualize the sample Either open the pre-generated `report.html` in a browser, or regenerate it: ```bash python visualize_datasample.py ``` This produces a self-contained HTML file with image thumbnails, metadata, and filter controls for split, label, and rejection reason.

提供机构：

Zenodo

创建时间：

2026-05-02

5,000+

优质数据集

54 个

任务类型

进入经典数据集