five

ziiio/CommonSketch

收藏
Hugging Face2026-04-20 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/ziiio/CommonSketch
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: CommonSketch license: cc-by-4.0 task_categories: - image-classification - image-to-text - visual-question-answering language: - en tags: - image - sketch - captions - visual-question-answering - commonsense - abstraction - computer-vision size_categories: - 10K<n<100K --- # CommonSketch ## Dataset Summary CommonSketch is a semantically annotated sketch dataset introduced in the paper [SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering](https://arxiv.org/abs/2603.28363). The dataset contains 23,100 human-drawn sketches across 300 object classes. Each sketch is paired with a fine-grained caption and element-level commonsense annotations for evaluating sketch abstraction and semantic recognizability. ## Dataset Structure ```text CommonSketch/ data/ train-00000-of-00012.parquet ... metadata.csv captions.csv vqa_element_annotation.json metadata/ classes.csv classes.txt commonsense_elements.json ``` ## Data Fields The main row-level metadata is provided in `metadata.csv`. | Field | Description | | --- | --- | | `file_name` | Relative path to the sketch image. | | `caption` | Fine-grained caption describing the sketch. | | `class_name` | Object class name. | | `class_id` | Class identifier from `001` to `300`. | | `category` | High-level category for the class. | The Parquet files under `data/` provide the dataset examples for direct loading with the Hugging Face `datasets` library. Each row contains the following fields: | Field | Description | | --- | --- | | `image` | Sketch image embedded in the dataset. | | `file_name` | Original relative image path. | | `caption` | Fine-grained caption describing the sketch. | | `class_name` | Object class name. | | `class_id` | Class identifier from `001` to `300`. | | `category` | High-level category for the class. | | `element_annotation` | JSON-serialized binary element annotation for the sketch. | The `captions.csv` file provides the image-caption pairs with the following fields. | Field | Description | | --- | --- | | `image` | Image file name. | | `caption` | Fine-grained caption describing the sketch. | ## Annotation Files `vqa_element_annotation.json` contains image-level binary element annotations. Each annotation indicates whether a class-specific commonsense element is present in the sketch. `metadata/commonsense_elements.json` defines the class-level commonsense element schema used by the VQA annotations. Each class entry includes its `class_id`, `total_elements`, and the list of element definitions. `metadata/classes.csv` provides the 300 class names, class IDs, high-level categories, and the number of commonsense elements per class. `metadata/classes.txt` provides the class list only. ## Category Statistics | Category | # Classes | | --- | ---: | | animal | 61 | | body part | 7 | | clothing | 8 | | container | 9 | | electronic device | 22 | | food | 28 | | furniture | 25 | | icon | 13 | | musical instrument | 11 | | nature | 14 | | sports equipment | 14 | | structure | 28 | | tool | 37 | | vehicle | 23 | ## Usage After upload to the Hugging Face Hub, the dataset can be loaded with: ```python from datasets import load_dataset dataset = load_dataset("ziiio/CommonSketch") ``` The `image` field contains the sketch image, and `element_annotation` contains the image-level element annotation as a JSON string. ## License CommonSketch is released under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. If you use CommonSketch, please cite the SEA paper. ## Citation ```bibtex @article{park2026sea, title={SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering}, author={Park, Jiho and Choi, Sieun and Seo, Jaeyoon and Sohn, Minho and Kim, Yeana and Kim, Jihie}, journal={arXiv preprint arXiv:2603.28363}, year={2026} } ``` ## Links - Paper: https://arxiv.org/abs/2603.28363 - PDF: https://arxiv.org/pdf/2603.28363 - Code: https://github.com/zihos/SEA - Project page: https://zihos.github.io/SEA ## Note CommonSketch includes a subset of the SketchDUO dataset. For more information about SketchDUO, please refer to https://huggingface.co/datasets/ziiio/SketchDUO.

数据集名称:CommonSketch 许可证:CC BY 4.0 任务类别: - 图像分类(image-classification) - 图像转文本(image-to-text) - 视觉问答(visual-question-answering) 语言: - 英语 标签: - 图像(image) - 草图(sketch) - 标题(captions) - 视觉问答(visual-question-answering) - 常识(commonsense) - 抽象性(abstraction) - 计算机视觉(computer-vision) 样本规模:10K<n<100K # 通用草图数据集(CommonSketch) ## 数据集概览 通用草图数据集(CommonSketch)是一篇发表于论文《SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering》(https://arxiv.org/abs/2603.28363)的语义标注草图数据集。本数据集包含覆盖300个目标类别的23100幅人工绘制草图。每幅草图均配有细粒度标题与元素级常识标注,用于评估草图的抽象性与语义可识别性。 ## 数据集结构 text CommonSketch/ data/ train-00000-of-00012.parquet ... metadata.csv captions.csv vqa_element_annotation.json metadata/ classes.csv classes.txt commonsense_elements.json ## 数据字段 行级元数据主要存储于`metadata.csv`中,字段说明如下: | 字段 | 说明 | | --- | --- | | `file_name` | 指向草图图像的相对路径 | | `caption` | 描述草图的细粒度标题 | | `class_name` | 目标类别名称 | | `class_id` | 类别标识符,取值范围为001至300 | | `category` | 该类别的高级类别归属 | `data/`目录下的Parquet文件可通过Hugging Face `datasets`库直接加载,其中每行包含以下字段: | 字段 | 说明 | | --- | --- | | `image` | 嵌入数据集的草图图像 | | `file_name` | 原始图像相对路径 | | `caption` | 描述草图的细粒度标题 | | `class_name` | 目标类别名称 | | `class_id` | 类别标识符,取值范围为001至300 | | `category` | 该类别的高级类别归属 | | `element_annotation` | 用于描述草图的JSON序列化二值化元素标注 | `captions.csv`文件存储图像-标题配对数据,字段说明如下: | 字段 | 说明 | | --- | --- | | `image` | 图像文件名 | | `caption` | 描述草图的细粒度标题 | ## 标注文件 `vqa_element_annotation.json`包含图像级二值化元素标注,每条标注用于指示草图中是否存在对应类别的常识元素。 `metadata/commonsense_elements.json`定义了VQA标注所使用的类别级常识元素模式,每个类别条目包含其`class_id`、总元素数(`total_elements`)以及元素定义列表。 `metadata/classes.csv`提供300个类别的名称、类别ID、高级类别归属以及每个类别的常识元素数量;`metadata/classes.txt`仅存储类别列表。 ## 类别统计 | 类别 | 类别数量 | | --- | ---: | | 动物 | 61 | | 身体部位 | 7 | | 服饰 | 8 | | 容器 | 9 | | 电子设备 | 22 | | 食品 | 28 | | 家具 | 25 | | 图标 | 13 | | 乐器 | 11 | | 自然事物 | 14 | | 体育器材 | 14 | | 建筑结构 | 28 | | 工具 | 37 | | 交通工具 | 23 | ## 使用方法 本数据集上传至Hugging Face Hub后,可通过以下代码加载: python from datasets import load_dataset dataset = load_dataset("ziiio/CommonSketch") 其中`image`字段存储草图图像,`element_annotation`字段存储以JSON字符串形式呈现的图像级元素标注。 ## 许可证 通用草图数据集(CommonSketch)采用知识共享署名4.0国际版(CC BY 4.0)协议发布。若使用本数据集,请引用SEA相关论文。 ## 引用格式 bibtex @article{park2026sea, title={SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering}, author={Park, Jiho and Choi, Sieun and Seo, Jaeyoon and Sohn, Minho and Kim, Yeana and Kim, Jihie}, journal={arXiv preprint arXiv:2603.28363}, year={2026} } ## 相关链接 - 论文:https://arxiv.org/abs/2603.28363 - PDF全文:https://arxiv.org/pdf/2603.28363 - 代码:https://github.com/zihos/SEA - 项目主页:https://zihos.github.io/SEA ## 说明 通用草图数据集(CommonSketch)包含SketchDUO数据集的子集。如需了解SketchDUO的更多信息,请访问https://huggingface.co/datasets/ziiio/SketchDUO。
提供机构:
ziiio
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作