ziiio/CommonSketch

Name: ziiio/CommonSketch
Creator: ziiio
Published: 2026-04-20 08:51:41
License: 暂无描述

Hugging Face2026-04-20 更新2026-04-26 收录

下载链接：

https://hf-mirror.com/datasets/ziiio/CommonSketch

下载链接

链接失效反馈

官方服务：

资源简介：

--- pretty_name: CommonSketch license: cc-by-4.0 task_categories: - image-classification - image-to-text - visual-question-answering language: - en tags: - image - sketch - captions - visual-question-answering - commonsense - abstraction - computer-vision size_categories: - 10K<n<100K --- # CommonSketch ## Dataset Summary CommonSketch is a semantically annotated sketch dataset introduced in the paper [SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering](https://arxiv.org/abs/2603.28363). The dataset contains 23,100 human-drawn sketches across 300 object classes. Each sketch is paired with a fine-grained caption and element-level commonsense annotations for evaluating sketch abstraction and semantic recognizability. ## Dataset Structure ```text CommonSketch/ data/ train-00000-of-00012.parquet ... metadata.csv captions.csv vqa_element_annotation.json metadata/ classes.csv classes.txt commonsense_elements.json ``` ## Data Fields The main row-level metadata is provided in `metadata.csv`. | Field | Description | | --- | --- | | `file_name` | Relative path to the sketch image. | | `caption` | Fine-grained caption describing the sketch. | | `class_name` | Object class name. | | `class_id` | Class identifier from `001` to `300`. | | `category` | High-level category for the class. | The Parquet files under `data/` provide the dataset examples for direct loading with the Hugging Face `datasets` library. Each row contains the following fields: | Field | Description | | --- | --- | | `image` | Sketch image embedded in the dataset. | | `file_name` | Original relative image path. | | `caption` | Fine-grained caption describing the sketch. | | `class_name` | Object class name. | | `class_id` | Class identifier from `001` to `300`. | | `category` | High-level category for the class. | | `element_annotation` | JSON-serialized binary element annotation for the sketch. | The `captions.csv` file provides the image-caption pairs with the following fields. | Field | Description | | --- | --- | | `image` | Image file name. | | `caption` | Fine-grained caption describing the sketch. | ## Annotation Files `vqa_element_annotation.json` contains image-level binary element annotations. Each annotation indicates whether a class-specific commonsense element is present in the sketch. `metadata/commonsense_elements.json` defines the class-level commonsense element schema used by the VQA annotations. Each class entry includes its `class_id`, `total_elements`, and the list of element definitions. `metadata/classes.csv` provides the 300 class names, class IDs, high-level categories, and the number of commonsense elements per class. `metadata/classes.txt` provides the class list only. ## Category Statistics | Category | # Classes | | --- | ---: | | animal | 61 | | body part | 7 | | clothing | 8 | | container | 9 | | electronic device | 22 | | food | 28 | | furniture | 25 | | icon | 13 | | musical instrument | 11 | | nature | 14 | | sports equipment | 14 | | structure | 28 | | tool | 37 | | vehicle | 23 | ## Usage After upload to the Hugging Face Hub, the dataset can be loaded with: ```python from datasets import load_dataset dataset = load_dataset("ziiio/CommonSketch") ``` The `image` field contains the sketch image, and `element_annotation` contains the image-level element annotation as a JSON string. ## License CommonSketch is released under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. If you use CommonSketch, please cite the SEA paper. ## Citation ```bibtex @article{park2026sea, title={SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering}, author={Park, Jiho and Choi, Sieun and Seo, Jaeyoon and Sohn, Minho and Kim, Yeana and Kim, Jihie}, journal={arXiv preprint arXiv:2603.28363}, year={2026} } ``` ## Links - Paper: https://arxiv.org/abs/2603.28363 - PDF: https://arxiv.org/pdf/2603.28363 - Code: https://github.com/zihos/SEA - Project page: https://zihos.github.io/SEA ## Note CommonSketch includes a subset of the SketchDUO dataset. For more information about SketchDUO, please refer to https://huggingface.co/datasets/ziiio/SketchDUO.

数据集名称：CommonSketch 许可证：CC BY 4.0 任务类别： - 图像分类（image-classification） - 图像转文本（image-to-text） - 视觉问答（visual-question-answering）语言： - 英语标签： - 图像（image） - 草图（sketch） - 标题（captions） - 视觉问答（visual-question-answering） - 常识（commonsense） - 抽象性（abstraction） - 计算机视觉（computer-vision）样本规模：10K<n<100K # 通用草图数据集（CommonSketch） ## 数据集概览通用草图数据集（CommonSketch）是一篇发表于论文《SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering》（https://arxiv.org/abs/2603.28363）的语义标注草图数据集。本数据集包含覆盖300个目标类别的23100幅人工绘制草图。每幅草图均配有细粒度标题与元素级常识标注，用于评估草图的抽象性与语义可识别性。 ## 数据集结构 text CommonSketch/ data/ train-00000-of-00012.parquet ... metadata.csv captions.csv vqa_element_annotation.json metadata/ classes.csv classes.txt commonsense_elements.json ## 数据字段行级元数据主要存储于`metadata.csv`中，字段说明如下： | 字段 | 说明 | | --- | --- | | `file_name` | 指向草图图像的相对路径 | | `caption` | 描述草图的细粒度标题 | | `class_name` | 目标类别名称 | | `class_id` | 类别标识符，取值范围为001至300 | | `category` | 该类别的高级类别归属 | `data/`目录下的Parquet文件可通过Hugging Face `datasets`库直接加载，其中每行包含以下字段： | 字段 | 说明 | | --- | --- | | `image` | 嵌入数据集的草图图像 | | `file_name` | 原始图像相对路径 | | `caption` | 描述草图的细粒度标题 | | `class_name` | 目标类别名称 | | `class_id` | 类别标识符，取值范围为001至300 | | `category` | 该类别的高级类别归属 | | `element_annotation` | 用于描述草图的JSON序列化二值化元素标注 | `captions.csv`文件存储图像-标题配对数据，字段说明如下： | 字段 | 说明 | | --- | --- | | `image` | 图像文件名 | | `caption` | 描述草图的细粒度标题 | ## 标注文件 `vqa_element_annotation.json`包含图像级二值化元素标注，每条标注用于指示草图中是否存在对应类别的常识元素。 `metadata/commonsense_elements.json`定义了VQA标注所使用的类别级常识元素模式，每个类别条目包含其`class_id`、总元素数（`total_elements`）以及元素定义列表。 `metadata/classes.csv`提供300个类别的名称、类别ID、高级类别归属以及每个类别的常识元素数量；`metadata/classes.txt`仅存储类别列表。 ## 类别统计 | 类别 | 类别数量 | | --- | ---: | | 动物 | 61 | | 身体部位 | 7 | | 服饰 | 8 | | 容器 | 9 | | 电子设备 | 22 | | 食品 | 28 | | 家具 | 25 | | 图标 | 13 | | 乐器 | 11 | | 自然事物 | 14 | | 体育器材 | 14 | | 建筑结构 | 28 | | 工具 | 37 | | 交通工具 | 23 | ## 使用方法本数据集上传至Hugging Face Hub后，可通过以下代码加载： python from datasets import load_dataset dataset = load_dataset("ziiio/CommonSketch") 其中`image`字段存储草图图像，`element_annotation`字段存储以JSON字符串形式呈现的图像级元素标注。 ## 许可证通用草图数据集（CommonSketch）采用知识共享署名4.0国际版（CC BY 4.0）协议发布。若使用本数据集，请引用SEA相关论文。 ## 引用格式 bibtex @article{park2026sea, title={SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering}, author={Park, Jiho and Choi, Sieun and Seo, Jaeyoon and Sohn, Minho and Kim, Yeana and Kim, Jihie}, journal={arXiv preprint arXiv:2603.28363}, year={2026} } ## 相关链接 - 论文：https://arxiv.org/abs/2603.28363 - PDF全文：https://arxiv.org/pdf/2603.28363 - 代码：https://github.com/zihos/SEA - 项目主页：https://zihos.github.io/SEA ## 说明通用草图数据集（CommonSketch）包含SketchDUO数据集的子集。如需了解SketchDUO的更多信息，请访问https://huggingface.co/datasets/ziiio/SketchDUO。

提供机构：

ziiio

5,000+

优质数据集

54 个

任务类型

进入经典数据集