Bi8le/fashionpedia

Name: Bi8le/fashionpedia
Creator: Bi8le
Published: 2026-04-07 10:33:52
License: 暂无描述

Hugging Face2026-04-07 更新2026-04-12 收录

下载链接：

https://hf-mirror.com/datasets/Bi8le/fashionpedia

下载链接

链接失效反馈

官方服务：

资源简介：

--- pretty_name: Fashionpedia task_categories: - object-detection language: - en license: - cc-by-4.0 multilinguality: - monolingual size_categories: - 10K<n<100K source_datasets: - original tags: - object-detection - fashion - computer-vision paperswithcode_id: fashionpedia --- # Dataset Card for Fashionpedia ## Table of Contents - [Dataset Description](#dataset-description) - [Dataset Summary](#dataset-summary) - [Supported Tasks and Leaderboards](#supported-tasks-and-leaderboards) - [Languages](#languages) - [Dataset Structure](#dataset-structure) - [Data Instances](#data-instances) - [Data Fields](#data-fields) - [Data Splits](#data-splits) - [Additional Information](#additional-information) - [Licensing Information](#licensing-information) - [Citation Information](#citation-information) - [Contributions](#contributions) ## Dataset Description - **Homepage:** https://fashionpedia.github.io/home/index.html - **Repository:** https://github.com/cvdfoundation/fashionpedia - **Paper:** https://arxiv.org/abs/2004.12276 ### Dataset Summary Fashionpedia is a dataset mapping out the visual aspects of the fashion world. From the paper: > Fashionpedia is a new dataset which consists of two parts: (1) an ontology built by fashion experts containing 27 main apparel categories, 19 apparel parts, 294 fine-grained attributes and their relationships; (2) a dataset with everyday and celebrity event fashion images annotated with segmentation masks and their associated per-mask fine-grained attributes, built upon the Fashionpedia ontology. Fashionpedia has: - 46781 images - 342182 bounding-boxes ### Supported Tasks - Object detection - Image classification ### Languages All of annotations use English as primary language. ## Dataset Structure The dataset is structured as follows: ```py DatasetDict({ train: Dataset({ features: ['image_id', 'image', 'width', 'height', 'objects'], num_rows: 45623 }) val: Dataset({ features: ['image_id', 'image', 'width', 'height', 'objects'], num_rows: 1158 }) }) ``` ### Data Instances An example of the data for one image is: ```py {'image_id': 23, 'image': <PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=682x1024>, 'width': 682, 'height': 1024, 'objects': {'bbox_id': [150311, 150312, 150313, 150314], 'category': [23, 23, 33, 10], 'bbox': [[445.0, 910.0, 505.0, 983.0], [239.0, 940.0, 284.0, 994.0], [298.0, 282.0, 386.0, 352.0], [210.0, 282.0, 448.0, 665.0]], 'area': [1422, 843, 373, 56375]}} ``` With the type of each field being defined as: ```py {'image_id': Value(dtype='int64'), 'image': Image(decode=True), 'width': Value(dtype='int64'), 'height': Value(dtype='int64'), 'objects': Sequence(feature={ 'bbox_id': Value(dtype='int64'), 'category': ClassLabel(num_classes=46, names=['shirt, blouse', 'top, t-shirt, sweatshirt', 'sweater', 'cardigan', 'jacket', 'vest', 'pants', 'shorts', 'skirt', 'coat', 'dress', 'jumpsuit', 'cape', 'glasses', 'hat', 'headband, head covering, hair accessory', 'tie', 'glove', 'watch', 'belt', 'leg warmer', 'tights, stockings', 'sock', 'shoe', 'bag, wallet', 'scarf', 'umbrella', 'hood', 'collar', 'lapel', 'epaulette', 'sleeve', 'pocket', 'neckline', 'buckle', 'zipper', 'applique', 'bead', 'bow', 'flower', 'fringe', 'ribbon', 'rivet', 'ruffle', 'sequin', 'tassel']), 'bbox': Sequence(feature=Value(dtype='float64'), length=4), 'area': Value(dtype='int64')}, length=-1)} ``` ### Data Fields The dataset has the following fields: - `image_id`: Unique numeric ID of the image. - `image`: A `PIL.Image.Image` object containing the image. Note that when accessing the image column: `dataset[0]["image"]` the image file is automatically decoded. Decoding of a large number of image files might take a significant amount of time. Thus it is important to first query the sample index before the `"image"` column, *i.e.* `dataset[0]["image"]` should **always** be preferred over `dataset["image"][0]` - `width`: Image width. - `height`: Image height. - `objects`: A dictionary containing bounding box metadata for the objects in the image: - `bbox_id`: Unique numeric ID of the bounding box annotation. - `category`: The object’s category. - `area`: The area of the bounding box. - `bbox`: The object’s bounding box (in the Pascal VOC format) ### Data Splits | | Train | Validation | Test | |----------------|--------|------------|------| | Images | 45623 | 1158 | 0 | | Bounding boxes | 333401 | 8781 | 0 | ## Additional Information ### Licensing Information Fashionpedia is licensed under a Creative Commons Attribution 4.0 International License. ### Citation Information ``` @inproceedings{jia2020fashionpedia, title={Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset}, author={Jia, Menglin and Shi, Mengyun and Sirotenko, Mikhail and Cui, Yin and Cardie, Claire and Hariharan, Bharath and Adam, Hartwig and Belongie, Serge} booktitle={European Conference on Computer Vision (ECCV)}, year={2020} } ``` ### Contributions Thanks to [@blinjrm](https://github.com/blinjrm) for adding this dataset.

--- pretty_name: 时尚百科（Fashionpedia） task_categories: - 目标检测（object-detection） language: - 英语（en） license: - 知识共享署名4.0（CC BY 4.0） multilinguality: - 单语言（monolingual） size_categories: - 10K<n<100K source_datasets: - 原创（original） tags: - 目标检测（object-detection） - 时尚（fashion） - 计算机视觉（computer-vision） paperswithcode_id: fashionpedia --- # 时尚百科（Fashionpedia）数据集卡片 ## 目录 - [数据集概述](#dataset-description) - [数据集摘要](#dataset-summary) - [支持任务与评测基准](#supported-tasks-and-leaderboards) - [语言](#languages) - [数据集结构](#dataset-structure) - [数据实例](#data-instances) - [数据字段](#data-fields) - [数据划分](#data-splits) - [附加信息](#additional-information) - [许可信息](#licensing-information) - [引用信息](#citation-information) - [贡献者](#contributions) ## 数据集概述 - **主页:** https://fashionpedia.github.io/home/index.html - **代码仓库:** https://github.com/cvdfoundation/fashionpedia - **论文:** https://arxiv.org/abs/2004.12276 ### 数据集摘要时尚百科（Fashionpedia）是一份涵盖时尚领域视觉要素的数据集。摘自原论文： > 时尚百科（Fashionpedia）为全新数据集，包含两部分内容：(1) 由时尚专家构建的本体库，涵盖27个主流服饰类别、19个服饰部件、294个细粒度属性及其相互关联；(2) 基于该本体库构建的日常与名人活动时尚图像数据集，图像已标注分割掩码及其对应的逐掩码细粒度属性。该数据集共包含： - 46781张图像 - 342182个边界框 ### 支持任务 - 目标检测（object-detection） - 图像分类（image classification） ### 语言所有标注均以英语作为主要语言。 ## 数据集结构该数据集的结构如下： python DatasetDict({ train: Dataset({ features: ['image_id', 'image', 'width', 'height', 'objects'], num_rows: 45623 }) val: Dataset({ features: ['image_id', 'image', 'width', 'height', 'objects'], num_rows: 1158 }) }) ### 数据实例单张图像的数据示例如下： python {'image_id': 23, 'image': <PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=682x1024>, 'width': 682, 'height': 1024, 'objects': {'bbox_id': [150311, 150312, 150313, 150314], 'category': [23, 23, 33, 10], 'bbox': [[445.0, 910.0, 505.0, 983.0], [239.0, 940.0, 284.0, 994.0], [298.0, 282.0, 386.0, 352.0], [210.0, 282.0, 448.0, 665.0]], 'area': [1422, 843, 373, 56375]}} 各字段的类型定义如下： python {'image_id': Value(dtype='int64'), 'image': Image(decode=True), 'width': Value(dtype='int64'), 'height': Value(dtype='int64'), 'objects': Sequence(feature={ 'bbox_id': Value(dtype='int64'), 'category': ClassLabel(num_classes=46, names=['shirt, blouse', 'top, t-shirt, sweatshirt', 'sweater', 'cardigan', 'jacket', 'vest', 'pants', 'shorts', 'skirt', 'coat', 'dress', 'jumpsuit', 'cape', 'glasses', 'hat', 'headband, head covering, hair accessory', 'tie', 'glove', 'watch', 'belt', 'leg warmer', 'tights, stockings', 'sock', 'shoe', 'bag, wallet', 'scarf', 'umbrella', 'hood', 'collar', 'lapel', 'epaulette', 'sleeve', 'pocket', 'neckline', 'buckle', 'zipper', 'applique', 'bead', 'bow', 'flower', 'fringe', 'ribbon', 'rivet', 'ruffle', 'sequin', 'tassel']), 'bbox': Sequence(feature=Value(dtype='float64'), length=4), 'area': Value(dtype='int64')}, length=-1)} ### 数据字段该数据集包含以下字段： - `image_id`: 图像的唯一数字标识符。 - `image`: 包含图像的`PIL.Image.Image`对象。请注意，当访问图像列时，`dataset[0]["image"]`会自动对图像文件进行解码。解码大量图像文件可能会耗费大量时间，因此建议优先通过样本索引查询图像列，即**始终使用`dataset[0]["image"]`而非`dataset["image"][0]`**。 - `width`: 图像宽度。 - `height`: 图像高度。 - `objects`: 包含图像中目标边界框元数据的字典： - `bbox_id`: 边界框标注的唯一数字标识符。 - `category`: 目标的类别。 - `area`: 边界框的面积。 - `bbox`: 目标的边界框（采用PASCAL VOC格式） ### 数据划分 | | 训练集 | 验证集 | 测试集 | |----------------|--------|--------|--------| | 图像数量 | 45623 | 1158 | 0 | | 边界框数量 | 333401 | 8781 | 0 | ## 附加信息 ### 许可信息时尚百科（Fashionpedia）采用知识共享署名4.0国际许可协议进行授权。 ### 引用信息 @inproceedings{jia2020fashionpedia, title={Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset}, author={Jia, Menglin and Shi, Mengyun and Sirotenko, Mikhail and Cui, Yin and Cardie, Claire and Hariharan, Bharath and Adam, Hartwig and Belongie, Serge} booktitle={European Conference on Computer Vision (ECCV)}, year={2020} } ### 贡献者感谢[@blinjrm](https://github.com/blinjrm)为本数据集的收录提供支持。

提供机构：

Bi8le

5,000+

优质数据集

54 个

任务类型

进入经典数据集