CyberHarem/sendai_kantaicollection
收藏数据集概述
数据集名称
Dataset of sendai/川内/川内 (Kantai Collection)
数据集描述
该数据集包含500张图像及其标签,主题为Kantai Collection中的sendai/川内/川内角色。核心标签包括two_side_up, brown_eyes, short_hair, brown_hair, hair_ornament, bangs, black_hair。
数据集来源
图像从多个网站(如danbooru, pixiv, zerochan等)爬取,爬虫系统由DeepGHS Team开发。
数据集包列表
| 名称 | 图像数量 | 大小 | 类型 | 描述 |
|---|---|---|---|---|
| raw | 500 | 567.54 MiB | Waifuc-Raw | 包含元信息的原始数据(最小边对齐到1400像素,如果更大)。 |
| 800 | 500 | 362.20 MiB | IMG+TXT | 短边不超过800像素的数据集。 |
| stage3-p480-800 | 1248 | 764.22 MiB | IMG+TXT | 3阶段裁剪数据集,区域不小于480x480像素。 |
| 1200 | 500 | 516.67 MiB | IMG+TXT | 短边不超过1200像素的数据集。 |
| stage3-p480-1200 | 1248 | 1005.00 MiB | IMG+TXT | 3阶段裁剪数据集,区域不小于480x480像素。 |
数据集加载
提供原始数据集(包括带标签的图像)用于waifuc加载。示例代码如下: python import os import zipfile from huggingface_hub import hf_hub_download from waifuc.source import LocalSource
下载原始归档文件
zip_file = hf_hub_download( repo_id=CyberHarem/sendai_kantaicollection, repo_type=dataset, filename=dataset-raw.zip, )
提取文件到指定目录
dataset_dir = dataset_dir os.makedirs(dataset_dir, exist_ok=True) with zipfile.ZipFile(zip_file, r) as zf: zf.extractall(dataset_dir)
使用waifuc加载数据集
source = LocalSource(dataset_dir) for item in source: print(item.image, item.meta[filename], item.meta[tags])
标签聚类结果
列出了标签聚类结果,可能有助于挖掘某些服装。
原始文本版本
| # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | Tags |
|---|---|---|---|---|---|---|---|
| 0 | 12 | ![]() |
![]() |
![]() |
![]() |
![]() |
1girl, elbow_gloves, solo, serafuku, smile, torpedo, white_scarf, looking_at_viewer, skirt, fingerless_gloves, single_thighhigh |
| 1 | 7 | ![]() |
![]() |
![]() |
![]() |
![]() |
1girl, double-breasted, elbow_gloves, serafuku, solo, fingerless_gloves, looking_at_viewer, smile, white_scarf, black_gloves, blush, skirt |
| 2 | 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
1girl, black_gloves, black_neckerchief, double-breasted, elbow_gloves, serafuku, smile, solo, upper_body, white_scarf, fingerless_gloves, looking_at_viewer, simple_background, white_background, blush, white_sailor_collar, hair_between_eyes, open_mouth |
| 3 | 6 | ![]() |
![]() |
![]() |
![]() |
![]() |
1girl, black_gloves, black_neckerchief, black_skirt, black_thighhighs, cowboy_shot, double-breasted, elbow_gloves, fingerless_gloves, looking_at_viewer, serafuku, single_thighhigh, solo, white_scarf, pleated_skirt, smile, open_mouth |
| 4 | 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
1girl, elbow_gloves, serafuku, solo, looking_at_viewer, open_mouth, :d, black_gloves, pleated_skirt, blush, neckerchief |
| 5 | 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
1girl, alternate_costume, looking_at_viewer, school_uniform, solo, :d, blush, hair_between_eyes, long_sleeves, open_mouth, pleated_skirt, red_bowtie, simple_background, twitter_username, white_scarf, black_jacket, blazer, blue_skirt, cardigan, cowboy_shot, grey_skirt, holding_food, open_clothes, plaid, white_background, white_shirt |
| 6 | 18 | ![]() |
![]() |
![]() |
![]() |
![]() |
1girl, alternate_costume, kimono, looking_at_viewer, solo, smile, hair_between_eyes, obi, wide_sleeves, blush, long_sleeves, floral_print, open_mouth |
表格版本
| # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | 1girl | elbow_gloves | solo | serafuku | smile | torpedo | white_scarf | looking_at_viewer | skirt | fingerless_gloves | single_thighhigh | double-breasted | black_gloves | blush | black_neckerchief | upper_body | simple_background | white_background | white_sailor_collar | hair_between_eyes | open_mouth | black_skirt | black_thighhighs | cowboy_shot | pleated_skirt | :d | neckerchief | alternate_costume | school_uniform | long_sleeves | red_bowtie | twitter_username | black_jacket | blazer | blue_skirt | cardigan | grey_skirt | holding_food | open_clothes | plaid | white_shirt | kimono | obi | wide_sleeves | floral_print |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 12 | ![]() |
![]() |
![]() |
![]() |
![]() |
X | X | X | X | X | X | X | X | X | X | X | ||||||||||||||||||||||||||||||||||
| 1 | 7 | ![]() |
![]() |
![]() |
![]() |
![]() |
X | X | X | X | X | X | X | X | X | X | X | X | |||||||||||||||||||||||||||||||||
| 2 | 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | |||||||||||||||||||||||||||
| 3 | 6 | ![]() |
![]() |
![]() |
![]() |
![]() |
X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | ||||||||||||||||||||||||||||
| 4 | 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
X | X | X | X | X | X | X | X | X | X | X | ||||||||||||||||||||||||||||||||||
| 5 | 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | |||||||||||||||||||
| 6 | 18 | ![]() |
![]() |
![]() |
![]() |
![]() |
X | X | X | X | X | X | X | X | X | X | X | X | X |






































