five

CyberHarem/akatsuki_azurlane

收藏
Hugging Face2024-01-13 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/CyberHarem/akatsuki_azurlane
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - text-to-image tags: - art - not-for-all-audiences size_categories: - n<1K --- # Dataset of akatsuki/暁/晓 (Azur Lane) This is the dataset of akatsuki/暁/晓 (Azur Lane), containing 17 images and their tags. The core tags of this character are `black_hair, long_hair, ponytail, bangs, red_eyes, hair_between_eyes, breasts, eyepatch, high_ponytail, horns`, which are pruned in this dataset. Images are crawled from many sites (e.g. danbooru, pixiv, zerochan ...), the auto-crawling system is powered by [DeepGHS Team](https://github.com/deepghs)([huggingface organization](https://huggingface.co/deepghs)). ## List of Packages | Name | Images | Size | Download | Type | Description | |:-----------------|---------:|:----------|:-------------------------------------------------------------------------------------------------------------------|:-----------|:---------------------------------------------------------------------| | raw | 17 | 13.39 MiB | [Download](https://huggingface.co/datasets/CyberHarem/akatsuki_azurlane/resolve/main/dataset-raw.zip) | Waifuc-Raw | Raw data with meta information (min edge aligned to 1400 if larger). | | 800 | 17 | 10.79 MiB | [Download](https://huggingface.co/datasets/CyberHarem/akatsuki_azurlane/resolve/main/dataset-800.zip) | IMG+TXT | dataset with the shorter side not exceeding 800 pixels. | | stage3-p480-800 | 40 | 19.30 MiB | [Download](https://huggingface.co/datasets/CyberHarem/akatsuki_azurlane/resolve/main/dataset-stage3-p480-800.zip) | IMG+TXT | 3-stage cropped dataset with the area not less than 480x480 pixels. | | 1200 | 17 | 12.87 MiB | [Download](https://huggingface.co/datasets/CyberHarem/akatsuki_azurlane/resolve/main/dataset-1200.zip) | IMG+TXT | dataset with the shorter side not exceeding 1200 pixels. | | stage3-p480-1200 | 40 | 22.53 MiB | [Download](https://huggingface.co/datasets/CyberHarem/akatsuki_azurlane/resolve/main/dataset-stage3-p480-1200.zip) | IMG+TXT | 3-stage cropped dataset with the area not less than 480x480 pixels. | ### Load Raw Dataset with Waifuc We provide raw dataset (including tagged images) for [waifuc](https://deepghs.github.io/waifuc/main/tutorials/installation/index.html) loading. If you need this, just run the following code ```python import os import zipfile from huggingface_hub import hf_hub_download from waifuc.source import LocalSource # download raw archive file zip_file = hf_hub_download( repo_id='CyberHarem/akatsuki_azurlane', repo_type='dataset', filename='dataset-raw.zip', ) # extract files to your directory dataset_dir = 'dataset_dir' os.makedirs(dataset_dir, exist_ok=True) with zipfile.ZipFile(zip_file, 'r') as zf: zf.extractall(dataset_dir) # load the dataset with waifuc source = LocalSource(dataset_dir) for item in source: print(item.image, item.meta['filename'], item.meta['tags']) ``` ## List of Clusters List of tag clustering result, maybe some outfits can be mined here. ### Raw Text Version | # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | Tags | |----:|----------:|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------| | 0 | 13 | ![](samples/0/clu0-sample0.png) | ![](samples/0/clu0-sample1.png) | ![](samples/0/clu0-sample2.png) | ![](samples/0/clu0-sample3.png) | ![](samples/0/clu0-sample4.png) | 1girl, looking_at_viewer, solo, mask, scarf, simple_background, white_background, elbow_gloves, fingerless_gloves, full_body, midriff, ninja, weapon | ### Table Version | # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | 1girl | looking_at_viewer | solo | mask | scarf | simple_background | white_background | elbow_gloves | fingerless_gloves | full_body | midriff | ninja | weapon | |----:|----------:|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:--------|:--------------------|:-------|:-------|:--------|:--------------------|:-------------------|:---------------|:--------------------|:------------|:----------|:--------|:---------| | 0 | 13 | ![](samples/0/clu0-sample0.png) | ![](samples/0/clu0-sample1.png) | ![](samples/0/clu0-sample2.png) | ![](samples/0/clu0-sample3.png) | ![](samples/0/clu0-sample4.png) | X | X | X | X | X | X | X | X | X | X | X | X | X |

This is the dataset of akatsuki/暁/晓 (Azur Lane), containing 17 images and their tags. The tags detail the characters features, such as black hair, long hair, ponytail, bangs, red eyes, etc. The dataset offers multiple versions for download, each with images of different sizes and processing methods. Additionally, the dataset includes tag clustering results, which may help in mining different outfits of the character.
提供机构:
CyberHarem
原始信息汇总

数据集概述

数据集信息

  • 名称: Dataset of akatsuki/暁/晓 (Azur Lane)
  • 许可证: MIT
  • 任务类别: text-to-image
  • 标签: art, not-for-all-audiences
  • 大小类别: n<1K

数据集内容

  • 图像数量: 17张
  • 核心标签: black_hair, long_hair, ponytail, bangs, red_eyes, hair_between_eyes, breasts, eyepatch, high_ponytail, horns
  • 来源: 从多个网站爬取,如danbooru, pixiv, zerochan等

数据集包列表

名称 图像数量 大小 类型 描述
raw 17 13.39 MiB Waifuc-Raw 原始数据,包含元信息(最小边对齐到1400像素,如果更大)。
800 17 10.79 MiB IMG+TXT 短边不超过800像素的数据集。
stage3-p480-800 40 19.30 MiB IMG+TXT 3阶段裁剪数据集,区域不小于480x480像素。
1200 17 12.87 MiB IMG+TXT 短边不超过1200像素的数据集。
stage3-p480-1200 40 22.53 MiB IMG+TXT 3阶段裁剪数据集,区域不小于480x480像素。

数据集加载

  • 加载工具: waifuc

  • 加载代码: python import os import zipfile from huggingface_hub import hf_hub_download from waifuc.source import LocalSource

    下载原始归档文件

    zip_file = hf_hub_download( repo_id=CyberHarem/akatsuki_azurlane, repo_type=dataset, filename=dataset-raw.zip, )

    提取文件到指定目录

    dataset_dir = dataset_dir os.makedirs(dataset_dir, exist_ok=True) with zipfile.ZipFile(zip_file, r) as zf: zf.extractall(dataset_dir)

    使用waifuc加载数据集

    source = LocalSource(dataset_dir) for item in source: print(item.image, item.meta[filename], item.meta[tags])

标签聚类结果

原始文本版本

# 样本数量 Img-1 Img-2 Img-3 Img-4 Img-5 标签
0 13 1girl, looking_at_viewer, solo, mask, scarf, simple_background, white_background, elbow_gloves, fingerless_gloves, full_body, midriff, ninja, weapon

表格版本

# 样本数量 Img-1 Img-2 Img-3 Img-4 Img-5 1girl looking_at_viewer solo mask scarf simple_background white_background elbow_gloves fingerless_gloves full_body midriff ninja weapon
0 13 X X X X X X X X X X X X X
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作