CyberHarem/aurora_arknights
收藏Hugging Face2024-03-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/CyberHarem/aurora_arknights
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- text-to-image
tags:
- art
- not-for-all-audiences
size_categories:
- n<1K
---
# Dataset of aurora/オーロラ/极光 (Arknights)
This is the dataset of aurora/オーロラ/极光 (Arknights), containing 434 images and their tags.
The core tags of this character are `animal_ears, bear_ears, blue_eyes, breasts, hairband, black_hairband, long_hair, hair_over_one_eye, large_breasts, white_hair, very_long_hair, grey_hair, eyes_visible_through_hair`, which are pruned in this dataset.
Images are crawled from many sites (e.g. danbooru, pixiv, zerochan ...), the auto-crawling system is powered by [DeepGHS Team](https://github.com/deepghs)([huggingface organization](https://huggingface.co/deepghs)).
## List of Packages
| Name | Images | Size | Download | Type | Description |
|:-----------------|---------:|:-----------|:------------------------------------------------------------------------------------------------------------------|:-----------|:---------------------------------------------------------------------|
| raw | 434 | 773.15 MiB | [Download](https://huggingface.co/datasets/CyberHarem/aurora_arknights/resolve/main/dataset-raw.zip) | Waifuc-Raw | Raw data with meta information (min edge aligned to 1400 if larger). |
| 1200 | 434 | 634.85 MiB | [Download](https://huggingface.co/datasets/CyberHarem/aurora_arknights/resolve/main/dataset-1200.zip) | IMG+TXT | dataset with the shorter side not exceeding 1200 pixels. |
| stage3-p480-1200 | 1105 | 1.24 GiB | [Download](https://huggingface.co/datasets/CyberHarem/aurora_arknights/resolve/main/dataset-stage3-p480-1200.zip) | IMG+TXT | 3-stage cropped dataset with the area not less than 480x480 pixels. |
### Load Raw Dataset with Waifuc
We provide raw dataset (including tagged images) for [waifuc](https://deepghs.github.io/waifuc/main/tutorials/installation/index.html) loading. If you need this, just run the following code
```python
import os
import zipfile
from huggingface_hub import hf_hub_download
from waifuc.source import LocalSource
# download raw archive file
zip_file = hf_hub_download(
repo_id='CyberHarem/aurora_arknights',
repo_type='dataset',
filename='dataset-raw.zip',
)
# extract files to your directory
dataset_dir = 'dataset_dir'
os.makedirs(dataset_dir, exist_ok=True)
with zipfile.ZipFile(zip_file, 'r') as zf:
zf.extractall(dataset_dir)
# load the dataset with waifuc
source = LocalSource(dataset_dir)
for item in source:
print(item.image, item.meta['filename'], item.meta['tags'])
```
## List of Clusters
List of tag clustering result, maybe some outfits can be mined here.
### Raw Text Version
| # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | Tags |
|----:|----------:|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 0 | 16 |  |  |  |  |  | 1girl, black_gloves, black_shirt, cowboy_shot, crop_top, cropped_jacket, long_sleeves, looking_at_viewer, midriff, navel, solo, stomach, white_jacket, grey_shorts, short_shorts, cleavage_cutout, simple_background, smile, pouch, standing, infection_monitor_(arknights), white_background, thighs |
| 1 | 5 |  |  |  |  |  | 1girl, black_gloves, black_shirt, blush, cowboy_shot, crop_top, cropped_jacket, grey_shorts, long_sleeves, looking_at_viewer, midriff, navel, short_shorts, simple_background, solo, stomach, white_jacket, cleavage_cutout, hairclip, pouch, standing, white_background, smile, parted_lips, shield |
| 2 | 19 |  |  |  |  |  | 1girl, crop_top, long_sleeves, solo, upper_body, black_gloves, cropped_jacket, looking_at_viewer, midriff, simple_background, white_jacket, black_shirt, navel, white_background, stomach, blush, cleavage_cutout, smile, hairclip |
| 3 | 6 |  |  |  |  |  | 1girl, cleavage_cutout, crop_top, cropped_jacket, hairclip, long_sleeves, looking_at_viewer, smile, solo, upper_body, white_jacket, black_gloves, black_shirt, simple_background, white_background, blush, closed_mouth, hand_up |
| 4 | 18 |  |  |  |  |  | 1girl, black_gloves, crop_top, long_sleeves, looking_at_viewer, midriff, navel, pouch, short_shorts, shrug_(clothing), solo, stomach, cleavage, cowboy_shot, standing, belt, black_shirt, thighs, simple_background, thigh_strap, grey_shorts, white_background, black_shorts, jacket, thighhighs, smile |
| 5 | 9 |  |  |  |  |  | 1girl, alternate_costume, long_sleeves, ribbed_sweater, smile, bear_girl, blush, cleavage_cutout, looking_at_viewer, simple_background, solo, turtleneck_sweater, white_background, grey_sweater, heart, open-chest_sweater, open_mouth, sleeves_past_wrists, white_sweater, bear_tail, closed_mouth, hairclip, upper_body |
| 6 | 10 |  |  |  |  |  | 1girl, goggles_on_head, long_sleeves, solo, coat, looking_at_viewer, official_alternate_costume, outdoors, black_gloves, black_jacket, open_jacket, snow, upper_body, parted_lips, sky, bodysuit, choker, signature |
| 7 | 8 |  |  |  |  |  | blush, navel, nipples, 1girl, looking_at_viewer, solo_focus, sweat, 1boy, bar_censor, bear_girl, collarbone, completely_nude, hetero, open_mouth, penis, sex, vaginal, cum_in_pussy, spread_legs, cowgirl_position, girl_on_top, pov, stomach, extra_ears, heart, on_back, on_bed, smile |
### Table Version
| # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | 1girl | black_gloves | black_shirt | cowboy_shot | crop_top | cropped_jacket | long_sleeves | looking_at_viewer | midriff | navel | solo | stomach | white_jacket | grey_shorts | short_shorts | cleavage_cutout | simple_background | smile | pouch | standing | infection_monitor_(arknights) | white_background | thighs | blush | hairclip | parted_lips | shield | upper_body | closed_mouth | hand_up | shrug_(clothing) | cleavage | belt | thigh_strap | black_shorts | jacket | thighhighs | alternate_costume | ribbed_sweater | bear_girl | turtleneck_sweater | grey_sweater | heart | open-chest_sweater | open_mouth | sleeves_past_wrists | white_sweater | bear_tail | goggles_on_head | coat | official_alternate_costume | outdoors | black_jacket | open_jacket | snow | sky | bodysuit | choker | signature | nipples | solo_focus | sweat | 1boy | bar_censor | collarbone | completely_nude | hetero | penis | sex | vaginal | cum_in_pussy | spread_legs | cowgirl_position | girl_on_top | pov | extra_ears | on_back | on_bed |
|----:|----------:|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:--------|:---------------|:--------------|:--------------|:-----------|:-----------------|:---------------|:--------------------|:----------|:--------|:-------|:----------|:---------------|:--------------|:---------------|:------------------|:--------------------|:--------|:--------|:-----------|:--------------------------------|:-------------------|:---------|:--------|:-----------|:--------------|:---------|:-------------|:---------------|:----------|:-------------------|:-----------|:-------|:--------------|:---------------|:---------|:-------------|:--------------------|:-----------------|:------------|:---------------------|:---------------|:--------|:---------------------|:-------------|:----------------------|:----------------|:------------|:------------------|:-------|:-----------------------------|:-----------|:---------------|:--------------|:-------|:------|:-----------|:---------|:------------|:----------|:-------------|:--------|:-------|:-------------|:-------------|:------------------|:---------|:--------|:------|:----------|:---------------|:--------------|:-------------------|:--------------|:------|:-------------|:----------|:---------|
| 0 | 16 |  |  |  |  |  | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 1 | 5 |  |  |  |  |  | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | | X | | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 2 | 19 |  |  |  |  |  | X | X | X | | X | X | X | X | X | X | X | X | X | | | X | X | X | | | | X | | X | X | | | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 3 | 6 |  |  |  |  |  | X | X | X | | X | X | X | X | | | X | | X | | | X | X | X | | | | X | | X | X | | | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 4 | 18 |  |  |  |  |  | X | X | X | X | X | | X | X | X | X | X | X | | X | X | | X | X | X | X | | X | X | | | | | | | | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 5 | 9 |  |  |  |  |  | X | | | | | | X | X | | | X | | | | | X | X | X | | | | X | | X | X | | | X | X | | | | | | | | | X | X | X | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 6 | 10 |  |  |  |  |  | X | X | | | | | X | X | | | X | | | | | | | | | | | | | | | X | | X | | | | | | | | | | | | | | | | | | | | | X | X | X | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | |
| 7 | 8 |  |  |  |  |  | X | | | | | | | X | | X | | X | | | | | | X | | | | | | X | | | | | | | | | | | | | | | | X | | | X | | X | | | | | | | | | | | | | | | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X |
提供机构:
CyberHarem
原始信息汇总
数据集概述
数据集名称
- 名称: Dataset of aurora/オーロラ/极光 (Arknights)
数据集内容
- 包含: 434张图像及其标签
- 核心标签: animal_ears, bear_ears, blue_eyes, breasts, hairband, black_hairband, long_hair, hair_over_one_eye, large_breasts, white_hair, very_long_hair, grey_hair, eyes_visible_through_hair
数据集来源
- 来源: 从多个网站爬取,包括danbooru, pixiv, zerochan等
- 爬取系统: 由DeepGHS Team提供
数据集版本
原始数据
- 名称: raw
- 图像数量: 434
- 大小: 773.15 MiB
- 格式: Waifuc-Raw
- 描述: 包含元信息的原始数据,最小边对齐到1400像素(如果更大)
1200像素版本
- 名称: 1200
- 图像数量: 434
- 大小: 634.85 MiB
- 格式: IMG+TXT
- 描述: 短边不超过1200像素的图像数据集
阶段3-p480-1200版本
- 名称: stage3-p480-1200
- 图像数量: 1105
- 大小: 1.24 GiB
- 格式: IMG+TXT
- 描述: 三阶段裁剪数据集,区域不小于480x480像素
数据集使用
-
加载工具: 使用waifuc加载原始数据集
-
加载代码示例: python import os import zipfile from huggingface_hub import hf_hub_download from waifuc.source import LocalSource
zip_file = hf_hub_download( repo_id=CyberHarem/aurora_arknights, repo_type=dataset, filename=dataset-raw.zip, )
dataset_dir = dataset_dir os.makedirs(dataset_dir, exist_ok=True) with zipfile.ZipFile(zip_file, r) as zf: zf.extractall(dataset_dir)
source = LocalSource(dataset_dir) for item in source: print(item.image, item.meta[filename], item.meta[tags])
数据集标签集群
- 集群列表: 提供标签集群结果,可能包含可挖掘的服装信息
- 集群示例:
- 集群0: 包含16个样本,主要标签包括1girl, black_gloves, black_shirt等
- 集群1: 包含5个样本,主要标签包括1girl, black_gloves, black_shirt等
- 集群2: 包含19个样本,主要标签包括1girl, black_gloves, black_shirt等
- 集群3: 包含6个样本,主要标签包括1girl, black_gloves, black_shirt等
- 集群4: 包含18个样本,主要标签包括1girl, black_gloves, black_shirt等
- 集群5: 包含9个样本,主要标签包括1girl, black_gloves, black_shirt等
- 集群6: 包含10个样本,主要标签包括1girl, black_gloves, black_shirt等
- 集群7: 包含8个样本,主要标签包括1girl, black_gloves, black_shirt等



