five

CyberHarem/rina_shioi_mahoushoujosite

收藏
Hugging Face2024-03-31 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/CyberHarem/rina_shioi_mahoushoujosite
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - text-to-image tags: - art - not-for-all-audiences size_categories: - n<1K --- # Dataset of Rina Shioi/潮井梨ナ (Mahou Shoujo Site) This is the dataset of Rina Shioi/潮井梨ナ (Mahou Shoujo Site), containing 245 images and their tags. The core tags of this character are `pink_hair, glasses, twintails, scrunchie, long_hair, pink_eyes, hair_scrunchie, hair_ornament, low_twintails, black-framed_eyewear, breasts`, which are pruned in this dataset. Images are crawled from many sites (e.g. danbooru, pixiv, zerochan ...), the auto-crawling system is powered by [DeepGHS Team](https://github.com/deepghs)([huggingface organization](https://huggingface.co/deepghs)). ## List of Packages | Name | Images | Size | Download | Type | Description | |:-----------------|---------:|:-----------|:----------------------------------------------------------------------------------------------------------------------------|:-----------|:---------------------------------------------------------------------| | raw | 245 | 152.72 MiB | [Download](https://huggingface.co/datasets/CyberHarem/rina_shioi_mahoushoujosite/resolve/main/dataset-raw.zip) | Waifuc-Raw | Raw data with meta information (min edge aligned to 1400 if larger). | | 1200 | 245 | 152.62 MiB | [Download](https://huggingface.co/datasets/CyberHarem/rina_shioi_mahoushoujosite/resolve/main/dataset-1200.zip) | IMG+TXT | dataset with the shorter side not exceeding 1200 pixels. | | stage3-p480-1200 | 496 | 276.59 MiB | [Download](https://huggingface.co/datasets/CyberHarem/rina_shioi_mahoushoujosite/resolve/main/dataset-stage3-p480-1200.zip) | IMG+TXT | 3-stage cropped dataset with the area not less than 480x480 pixels. | ### Load Raw Dataset with Waifuc We provide raw dataset (including tagged images) for [waifuc](https://deepghs.github.io/waifuc/main/tutorials/installation/index.html) loading. If you need this, just run the following code ```python import os import zipfile from huggingface_hub import hf_hub_download from waifuc.source import LocalSource # download raw archive file zip_file = hf_hub_download( repo_id='CyberHarem/rina_shioi_mahoushoujosite', repo_type='dataset', filename='dataset-raw.zip', ) # extract files to your directory dataset_dir = 'dataset_dir' os.makedirs(dataset_dir, exist_ok=True) with zipfile.ZipFile(zip_file, 'r') as zf: zf.extractall(dataset_dir) # load the dataset with waifuc source = LocalSource(dataset_dir) for item in source: print(item.image, item.meta['filename'], item.meta['tags']) ``` ## List of Clusters List of tag clustering result, maybe some outfits can be mined here. ### Raw Text Version | # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | Tags | |----:|----------:|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | 0 | 7 | ![](samples/0/clu0-sample0.png) | ![](samples/0/clu0-sample1.png) | ![](samples/0/clu0-sample2.png) | ![](samples/0/clu0-sample3.png) | ![](samples/0/clu0-sample4.png) | 1girl, gradient_hair, serafuku, solo, upper_body, short_sleeves, smile, open_mouth, red_bowtie, looking_at_viewer, nosebleed, white_sailor_collar, black_shirt, collarbone, locker, sweat | | 1 | 8 | ![](samples/1/clu1-sample0.png) | ![](samples/1/clu1-sample1.png) | ![](samples/1/clu1-sample2.png) | ![](samples/1/clu1-sample3.png) | ![](samples/1/clu1-sample4.png) | 1girl, solo, t-shirt, backpack, gradient_hair, large_breasts, short_sleeves, white_shirt, upper_body, open_mouth, sky, :d, cloud, opaque_glasses, two-tone_hair | | 2 | 7 | ![](samples/2/clu2-sample0.png) | ![](samples/2/clu2-sample1.png) | ![](samples/2/clu2-sample2.png) | ![](samples/2/clu2-sample3.png) | ![](samples/2/clu2-sample4.png) | 1girl, collarbone, shirt, smile, solo, upper_body, looking_at_viewer, blush, large_breasts, open_mouth | | 3 | 6 | ![](samples/3/clu3-sample0.png) | ![](samples/3/clu3-sample1.png) | ![](samples/3/clu3-sample2.png) | ![](samples/3/clu3-sample3.png) | ![](samples/3/clu3-sample4.png) | 1girl, day, outdoors, railing, short_sleeves, solo, black_shirt, black_skirt, blue_sky, cloud, sailor_collar, serafuku, pleated_skirt, anime_coloring | | 4 | 22 | ![](samples/4/clu4-sample0.png) | ![](samples/4/clu4-sample1.png) | ![](samples/4/clu4-sample2.png) | ![](samples/4/clu4-sample3.png) | ![](samples/4/clu4-sample4.png) | 1girl, ponytail, solo, upper_body, blue_kimono, open_mouth, looking_at_viewer, indoors | | 5 | 6 | ![](samples/5/clu5-sample0.png) | ![](samples/5/clu5-sample1.png) | ![](samples/5/clu5-sample2.png) | ![](samples/5/clu5-sample3.png) | ![](samples/5/clu5-sample4.png) | black_hair, black_serafuku, black_shirt, black_skirt, outdoors, pleated_skirt, short_hair, short_sleeves, 3girls, arms_behind_back, blonde_hair, blue_shorts, brown_footwear, pantyhose, shoes, socks, white_sailor_collar, bound, denim_shorts, standing | | 6 | 7 | ![](samples/6/clu6-sample0.png) | ![](samples/6/clu6-sample1.png) | ![](samples/6/clu6-sample2.png) | ![](samples/6/clu6-sample3.png) | ![](samples/6/clu6-sample4.png) | cloud, day, ocean, outdoors, blue_bikini, polka_dot_bikini, 1girl, smile, solo, looking_at_viewer, medium_breasts, open_mouth, blue_sky, holding, navel, standing, wading, water_gun | ### Table Version | # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | 1girl | gradient_hair | serafuku | solo | upper_body | short_sleeves | smile | open_mouth | red_bowtie | looking_at_viewer | nosebleed | white_sailor_collar | black_shirt | collarbone | locker | sweat | t-shirt | backpack | large_breasts | white_shirt | sky | :d | cloud | opaque_glasses | two-tone_hair | shirt | blush | day | outdoors | railing | black_skirt | blue_sky | sailor_collar | pleated_skirt | anime_coloring | ponytail | blue_kimono | indoors | black_hair | black_serafuku | short_hair | 3girls | arms_behind_back | blonde_hair | blue_shorts | brown_footwear | pantyhose | shoes | socks | bound | denim_shorts | standing | ocean | blue_bikini | polka_dot_bikini | medium_breasts | holding | navel | wading | water_gun | |----:|----------:|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:--------------------------------|:--------|:----------------|:-----------|:-------|:-------------|:----------------|:--------|:-------------|:-------------|:--------------------|:------------|:----------------------|:--------------|:-------------|:---------|:--------|:----------|:-----------|:----------------|:--------------|:------|:-----|:--------|:-----------------|:----------------|:--------|:--------|:------|:-----------|:----------|:--------------|:-----------|:----------------|:----------------|:-----------------|:-----------|:--------------|:----------|:-------------|:-----------------|:-------------|:---------|:-------------------|:--------------|:--------------|:-----------------|:------------|:--------|:--------|:--------|:---------------|:-----------|:--------|:--------------|:-------------------|:-----------------|:----------|:--------|:---------|:------------| | 0 | 7 | ![](samples/0/clu0-sample0.png) | ![](samples/0/clu0-sample1.png) | ![](samples/0/clu0-sample2.png) | ![](samples/0/clu0-sample3.png) | ![](samples/0/clu0-sample4.png) | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 1 | 8 | ![](samples/1/clu1-sample0.png) | ![](samples/1/clu1-sample1.png) | ![](samples/1/clu1-sample2.png) | ![](samples/1/clu1-sample3.png) | ![](samples/1/clu1-sample4.png) | X | X | | X | X | X | | X | | | | | | | | | X | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 2 | 7 | ![](samples/2/clu2-sample0.png) | ![](samples/2/clu2-sample1.png) | ![](samples/2/clu2-sample2.png) | ![](samples/2/clu2-sample3.png) | ![](samples/2/clu2-sample4.png) | X | | | X | X | | X | X | | X | | | | X | | | | | X | | | | | | | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 3 | 6 | ![](samples/3/clu3-sample0.png) | ![](samples/3/clu3-sample1.png) | ![](samples/3/clu3-sample2.png) | ![](samples/3/clu3-sample3.png) | ![](samples/3/clu3-sample4.png) | X | | X | X | | X | | | | | | | X | | | | | | | | | | X | | | | | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | 4 | 22 | ![](samples/4/clu4-sample0.png) | ![](samples/4/clu4-sample1.png) | ![](samples/4/clu4-sample2.png) | ![](samples/4/clu4-sample3.png) | ![](samples/4/clu4-sample4.png) | X | | | X | X | | | X | | X | | | | | | | | | | | | | | | | | | | | | | | | | | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | 5 | 6 | ![](samples/5/clu5-sample0.png) | ![](samples/5/clu5-sample1.png) | ![](samples/5/clu5-sample2.png) | ![](samples/5/clu5-sample3.png) | ![](samples/5/clu5-sample4.png) | | | | | | X | | | | | | X | X | | | | | | | | | | | | | | | | X | | X | | | X | | | | | X | X | X | X | X | X | X | X | X | X | X | X | X | X | | | | | | | | | | 6 | 7 | ![](samples/6/clu6-sample0.png) | ![](samples/6/clu6-sample1.png) | ![](samples/6/clu6-sample2.png) | ![](samples/6/clu6-sample3.png) | ![](samples/6/clu6-sample4.png) | X | | | X | | | X | X | | X | | | | | | | | | | | | | X | | | | | X | X | | | X | | | | | | | | | | | | | | | | | | | | X | X | X | X | X | X | X | X | X |
提供机构:
CyberHarem
原始信息汇总

数据集概述

数据集信息

  • 名称: Dataset of Rina Shioi/潮井梨ナ (Mahou Shoujo Site)
  • 描述: 包含245张图片及其标签的数据集。
  • 核心标签: pink_hair, glasses, twintails, scrunchie, long_hair, pink_eyes, hair_scrunchie, hair_ornament, low_twintails, black-framed_eyewear, breasts
  • 来源: 从多个网站(如danbooru, pixiv, zerochan等)爬取。
  • 许可证: MIT
  • 任务类别: text-to-image
  • 标签: art, not-for-all-audiences
  • 大小类别: n<1K

数据集包列表

名称 图片数量 大小 类型 描述
raw 245 152.72 MiB Waifuc-Raw 包含元信息的原始数据(最小边对齐到1400像素,如果更大)。
1200 245 152.62 MiB IMG+TXT 短边不超过1200像素的数据集。
stage3-p480-1200 496 276.59 MiB IMG+TXT 3阶段裁剪数据集,区域不小于480x480像素。

数据集加载

  • 加载工具: waifuc

  • 示例代码: python import os import zipfile from huggingface_hub import hf_hub_download from waifuc.source import LocalSource

    下载原始归档文件

    zip_file = hf_hub_download( repo_id=CyberHarem/rina_shioi_mahoushoujosite, repo_type=dataset, filename=dataset-raw.zip, )

    解压文件到指定目录

    dataset_dir = dataset_dir os.makedirs(dataset_dir, exist_ok=True) with zipfile.ZipFile(zip_file, r) as zf: zf.extractall(dataset_dir)

    使用waifuc加载数据集

    source = LocalSource(dataset_dir) for item in source: print(item.image, item.meta[filename], item.meta[tags])

标签聚类结果

原始文本版本

# 样本数量 图片1 图片2 图片3 图片4 图片5 标签
0 7 1girl, gradient_hair, serafuku, solo, upper_body, short_sleeves, smile, open_mouth, red_bowtie, looking_at_viewer, nosebleed, white_sailor_collar, black_shirt, collarbone, locker, sweat
1 8 1girl, solo, t-shirt, backpack, gradient_hair, large_breasts, short_sleeves, white_shirt, upper_body, open_mouth, sky, :d, cloud, opaque_glasses, two-tone_hair
2 7 1girl, collarbone, shirt, smile, solo, upper_body, looking_at_viewer, blush, large_breasts, open_mouth
3 6 1girl, day, outdoors, railing, short_sleeves, solo, black_shirt, black_skirt, blue_sky, cloud, sailor_collar, serafuku, pleated_skirt, anime_coloring
4 22 1girl, ponytail, solo, upper_body, blue_kimono, open_mouth, looking_at_viewer, indoors
5 6 black_hair, black_serafuku, black_shirt, black_skirt, outdoors, pleated_skirt, short_hair, short_sleeves, 3girls, arms_behind_back, blonde_hair, blue_shorts, brown_footwear, pantyhose, shoes, socks, white_sailor_collar, bound, denim_shorts, standing
6 7 cloud, day, ocean, outdoors, blue_bikini, polka_dot_bikini, 1girl, smile, solo, looking_at_viewer, medium_breasts, open_mouth, blue_sky, holding, navel, standing, wading, water_gun

表格版本

# 样本数量 图片1 图片2 图片3 图片4 图片5 1girl gradient_hair serafuku solo upper_body short_sleeves smile open_mouth red_bowtie looking_at_viewer nosebleed white_sailor_collar black_shirt collarbone locker sweat t-shirt backpack large_breasts white_shirt sky :d cloud opaque_glasses two-tone_hair shirt blush day outdoors railing black_skirt blue_sky sailor_collar pleated_skirt anime_coloring ponytail blue_kimono indoors black_hair black_serafuku short_hair 3girls arms_behind_back blonde_hair blue_shorts brown_footwear pantyhose shoes socks bound denim_shorts standing ocean blue_bikini polka_dot_bikini medium_breasts holding navel wading water_gun
0 7 X X X X X X X X X X X X X X X
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作