CyberHarem/ceylon_arknights
收藏数据集概述
名称: Dataset of ceylon/セイロン/锡兰 (Arknights)
内容: 包含169张图像及其标签。
核心标签: long_hair, pink_hair, feather_hair, hair_bun, hat, white_headwear, yellow_eyes, bow, hat_bow, black_bow, very_long_hair
数据来源: 图像从多个网站爬取,如danbooru, pixiv, zerochan等,由DeepGHS Team提供技术支持。
数据集包详情
| 名称 | 图像数量 | 大小 | 类型 | 描述 |
|---|---|---|---|---|
| raw | 169 | 304.60 MiB | Waifuc-Raw | 原始数据,包含元信息,最小边对齐至1400像素(如果更大)。 |
| 1200 | 169 | 257.51 MiB | IMG+TXT | 数据集,短边不超过1200像素。 |
| stage3-p480-1200 | 422 | 495.11 MiB | IMG+TXT | 三阶段裁剪数据集,区域不小于480x480像素。 |
数据集加载示例
使用waifuc加载原始数据集的示例代码:
python import os import zipfile
from huggingface_hub import hf_hub_download from waifuc.source import LocalSource
下载原始存档文件
zip_file = hf_hub_download( repo_id=CyberHarem/ceylon_arknights, repo_type=dataset, filename=dataset-raw.zip, )
提取文件到目录
dataset_dir = dataset_dir os.makedirs(dataset_dir, exist_ok=True) with zipfile.ZipFile(zip_file, r) as zf: zf.extractall(dataset_dir)
使用waifuc加载数据集
source = LocalSource(dataset_dir) for item in source: print(item.image, item.meta[filename], item.meta[tags])
标签聚类结果
示例标签
| 聚类编号 | 样本数 | 示例图像及标签 |
|---|---|---|
| 0 | 20 | 1girl, blue_feathers, solo, white_gloves, holding_umbrella, looking_at_viewer, smile, blue_dress, long_sleeves, outdoors, white_umbrella, sky, white_shirt, cowboy_shot, off_shoulder, orange_eyes, day |
| 1 | 7 | 1girl, blue_dress, blue_feathers, orange_eyes, solo, white_gloves, long_sleeves, looking_at_viewer, simple_background, smile, white_background, black_footwear, full_body, holding_umbrella, standing, white_pantyhose, white_umbrella, frilled_dress, hand_up, high_heels, single_hair_bun |
| 2 | 14 | 1girl, solo, blue_feathers, looking_at_viewer, smile, simple_background, upper_body, white_gloves, white_background, white_shirt, closed_mouth, hand_up, parted_lips |
| 3 | 11 | sunglasses, 1girl, double_bun, eyewear_on_head, looking_at_viewer, solo, bare_shoulders, official_alternate_costume, short_shorts, cleavage, smile, white_shorts, belt, holding, navel, off_shoulder, blunt_bangs, blush, open_mouth, swimsuit, cowboy_shot, large_breasts, midriff, camisole, flower, food, hair_ornament, open_clothes, simple_background, sitting, white_background |



