rizavelioglu/fashionfail
收藏FashionFail 数据集
概述
FashionFail 数据集是由论文 "FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation" 提出的,包含 2,495 张高分辨率(2400x2400 像素)的电子商务网站产品图像。数据集分为训练集、验证集和测试集,分别包含 1,344、150 和 1,001 张图像。
数据集下载
数据集仅提供图像的 URL,而非直接共享图像文件。提供了一个脚本用于数据集构建,该脚本首先从 HuggingFace Datasets 获取注释文件,然后使用这些注释文件中的 URL 下载图像。
安装与使用
-
克隆仓库并安装: bash git clone https://github.com/rizavelioglu/fashionfail.git cd fashionfail pip install -e .
-
运行脚本构建数据集: bash python fashionfail/data/make_dataset.py
数据集将构建在
"~/.cache/fashionfail/"目录下,可选参数--save_dir可设置为指定目录。
注释格式
注释遵循 COCO 数据集 的格式,存储为 JSON 格式,结构如下: json { "info" : info, # dict: keys are shown below "licenses" : [license], # List[dict]: keys are shown below "categories" : [category], # List[dict]: keys are shown below "images" : [image], # List[dict]: keys are shown below "annotations" : [annotation], # List[dict]: keys are shown below }
info{ "year" : int, "version" : str, "description" : str, "contributor" : str, "url" : str, "date_created" : datetime, }
license{ "id" : int, "name" : str, "url" : str, }
category{ "id" : int, "name" : str, "supercategory" : str, }
image{ "id" : int, "file_name" : str, "height" : int, "width" : int, "license" : int, "original_url" : str, }
annotation{ "id" : int, "image_id" : int, "category_id" : int, "area" : int, "iscrowd" : int, # always 0 as instances represent a single object "bbox" : list[float], # [x,y,width,height] "segmentation" : str, # compressed RLE: {"size", (height, widht), "counts": str} }
许可证
该项目仅用于学术研究,不提供商业使用。图像和品牌归各自所有者所有。注释文件根据 Server Side Public License (SSPL) 授权。
引用
bibtex @inproceedings{velioglu2024fashionfail, author = {Velioglu, Riza and Chan, Robin and Hammer, Barbara}, title = {FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation}, journal = {IJCNN}, eprint = {2404.08582}, year = {2024}, }



