CyberHarem/yoimiya_genshin
收藏Hugging Face2024-05-13 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/CyberHarem/yoimiya_genshin
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- text-to-image
tags:
- art
- not-for-all-audiences
size_categories:
- n<1K
---
# Dataset of yoimiya/宵宮/宵宫 (Genshin Impact)
This is the dataset of yoimiya/宵宮/宵宫 (Genshin Impact), containing 500 images and their tags.
The core tags of this character are `blonde_hair, ponytail, breasts, hair_ornament, yellow_eyes, medium_breasts, hair_between_eyes, orange_eyes`, which are pruned in this dataset.
Images are crawled from many sites (e.g. danbooru, pixiv, zerochan ...), the auto-crawling system is powered by [DeepGHS Team](https://github.com/deepghs)([huggingface organization](https://huggingface.co/deepghs)).
## List of Packages
| Name | Images | Size | Download | Type | Description | Images-others | Images-head |
|:-----------------|---------:|:---------|:-----------------------------------------------------------------------------------------------------------------|:-----------|:---------------------------------------------------------------------|:----------------|:--------------|
| raw | 500 | 1.27 GiB | [Download](https://huggingface.co/datasets/CyberHarem/yoimiya_genshin/resolve/main/dataset-raw.zip) | Waifuc-Raw | Raw data with meta information (min edge aligned to 1400 if larger). | -- | -- |
| stage3-p480-1200 | 1380 | 2.08 GiB | [Download](https://huggingface.co/datasets/CyberHarem/yoimiya_genshin/resolve/main/dataset-stage3-p480-1200.zip) | IMG+TXT | 3-stage cropped dataset with the area not less than 480x480 pixels. | 877 | 503 |
### Load Raw Dataset with Waifuc
We provide raw dataset (including tagged images) for [waifuc](https://deepghs.github.io/waifuc/main/tutorials/installation/index.html) loading. If you need this, just run the following code
```python
import os
import zipfile
from huggingface_hub import hf_hub_download
from waifuc.source import LocalSource
# download raw archive file
zip_file = hf_hub_download(
repo_id='CyberHarem/yoimiya_genshin',
repo_type='dataset',
filename='dataset-raw.zip',
)
# extract files to your directory
dataset_dir = 'dataset_dir'
os.makedirs(dataset_dir, exist_ok=True)
with zipfile.ZipFile(zip_file, 'r') as zf:
zf.extractall(dataset_dir)
# load the dataset with waifuc
source = LocalSource(dataset_dir)
for item in source:
print(item.image, item.meta['filename'], item.meta['tags'])
```
## List of Clusters
List of tag clustering result, maybe some outfits can be mined here.
### Raw Text Version
| # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | Tags |
|----:|----------:|:--------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 0 | 16 |  |  |  |  |  | 1girl, bandaged_arm, chest_tattoo, looking_at_viewer, night_sky, obi, orange_kimono, red_choker, solo, arm_tattoo, fingerless_gloves, fireworks, shimenawa, bandaged_leg, hadanugi_dousa, chest_sarashi, cleavage, vision_(genshin_impact), black_gloves, pouch, open_mouth, :d, blush, candy_apple, holding, jewelry |
| 1 | 5 |  |  |  |  |  | 1girl, arm_tattoo, bandaged_arm, bandaged_leg, bare_shoulders, chest_sarashi, fireworks, hadanugi_dousa, looking_at_viewer, night_sky, orange_kimono, red_choker, solo, thighs, chest_tattoo, cleavage, shimenawa, sitting, blush, obi, outdoors, smile, socks |
| 2 | 5 |  |  |  |  |  | 1girl, arm_tattoo, bandaged_leg, chest_tattoo, hadanugi_dousa, looking_at_viewer, obi, orange_kimono, red_choker, solo, bandaged_arm, chest_sarashi, cleavage, shimenawa, vision_(genshin_impact), cowboy_shot, grin, fireworks, leaf, night, pouch, standing |
| 3 | 22 |  |  |  |  |  | 1girl, arm_tattoo, bandaged_arm, hadanugi_dousa, shimenawa, solo, chest_tattoo, holding_bow_(weapon), orange_kimono, looking_at_viewer, smile, red_choker, chest_sarashi, bandaged_leg, cleavage, fingerless_gloves, obi, open_mouth, black_gloves, simple_background |
| 4 | 5 |  |  |  |  |  | 1girl, arm_tattoo, blush, chest_sarashi, chest_tattoo, closed_mouth, hadanugi_dousa, looking_at_viewer, orange_kimono, red_choker, shimenawa, smile, solo, upper_body, bandaged_arm, cleavage, collarbone, fingerless_gloves, obi, single_bare_shoulder, vision_(genshin_impact), sparkler, bare_shoulders, black_gloves, bracelet, large_breasts |
| 5 | 8 |  |  |  |  |  | 1girl, alternate_costume, long_sleeves, looking_at_viewer, midriff, navel, solo, stomach, cleavage, open_jacket, short_shorts, smile, belt, cowboy_shot, red_choker, thighs, bare_shoulders, collarbone, crop_top, off_shoulder, bandeau, chest_tattoo, large_breasts, orange_jacket, simple_background, thigh_strap, white_background, black_shorts, blue_shorts, denim, sarashi, shirt, standing, tube_top |
| 6 | 11 |  |  |  |  |  | 1girl, alternate_costume, looking_at_viewer, pleated_skirt, solo, white_shirt, sailor_collar, serafuku, smile, choker, red_neckerchief, short_sleeves, blue_skirt, blush, contemporary, outdoors, sitting, black_skirt, holding, long_hair, long_sleeves, midriff, miniskirt, navel, open_mouth, shoes, thighs |
### Table Version
| # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | 1girl | bandaged_arm | chest_tattoo | looking_at_viewer | night_sky | obi | orange_kimono | red_choker | solo | arm_tattoo | fingerless_gloves | fireworks | shimenawa | bandaged_leg | hadanugi_dousa | chest_sarashi | cleavage | vision_(genshin_impact) | black_gloves | pouch | open_mouth | :d | blush | candy_apple | holding | jewelry | bare_shoulders | thighs | sitting | outdoors | smile | socks | cowboy_shot | grin | leaf | night | standing | holding_bow_(weapon) | simple_background | closed_mouth | upper_body | collarbone | single_bare_shoulder | sparkler | bracelet | large_breasts | alternate_costume | long_sleeves | midriff | navel | stomach | open_jacket | short_shorts | belt | crop_top | off_shoulder | bandeau | orange_jacket | thigh_strap | white_background | black_shorts | blue_shorts | denim | sarashi | shirt | tube_top | pleated_skirt | white_shirt | sailor_collar | serafuku | choker | red_neckerchief | short_sleeves | blue_skirt | contemporary | black_skirt | long_hair | miniskirt | shoes |
|----:|----------:|:--------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------|:--------|:---------------|:---------------|:--------------------|:------------|:------|:----------------|:-------------|:-------|:-------------|:--------------------|:------------|:------------|:---------------|:-----------------|:----------------|:-----------|:--------------------------|:---------------|:--------|:-------------|:-----|:--------|:--------------|:----------|:----------|:-----------------|:---------|:----------|:-----------|:--------|:--------|:--------------|:-------|:-------|:--------|:-----------|:-----------------------|:--------------------|:---------------|:-------------|:-------------|:-----------------------|:-----------|:-----------|:----------------|:--------------------|:---------------|:----------|:--------|:----------|:--------------|:---------------|:-------|:-----------|:---------------|:----------|:----------------|:--------------|:-------------------|:---------------|:--------------|:--------|:----------|:--------|:-----------|:----------------|:--------------|:----------------|:-----------|:---------|:------------------|:----------------|:-------------|:---------------|:--------------|:------------|:------------|:--------|
| 0 | 16 |  |  |  |  |  | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 1 | 5 |  |  |  |  |  | X | X | X | X | X | X | X | X | X | X | | X | X | X | X | X | X | | | | | | X | | | | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 2 | 5 |  |  |  |  |  | X | X | X | X | | X | X | X | X | X | | X | X | X | X | X | X | X | | X | | | | | | | | | | | | | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 3 | 22 |  |  |  |  |  | X | X | X | X | | X | X | X | X | X | X | | X | X | X | X | X | | X | | X | | | | | | | | | | X | | | | | | | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 4 | 5 |  |  |  |  |  | X | X | X | X | | X | X | X | X | X | X | | X | | X | X | X | X | X | | | | X | | | | X | | | | X | | | | | | | | | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 5 | 8 |  |  |  |  |  | X | | X | X | | | | X | X | | | | | | | | X | | | | | | | | | | X | X | | | X | | X | | | | X | | X | | | X | | | | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | | | | | | | | | | | | | |
| 6 | 11 |  |  |  |  |  | X | | | X | | | | | X | | | | | | | | | | | | X | | X | | X | | | X | X | X | X | | | | | | | | | | | | | | | | X | X | X | X | | | | | | | | | | | | | | | | | X | X | X | X | X | X | X | X | X | X | X | X | X |
提供机构:
CyberHarem
原始信息汇总
数据集概述
基本信息
- 名称: yoimiya/宵宮/宵宫 (Genshin Impact)
- 许可证: MIT
- 任务类别: text-to-image
- 标签: art, not-for-all-audiences
- 大小类别: n<1K
数据集内容
- 描述: 包含500张图像及其标签,主要角色标签包括
blonde_hair, ponytail, breasts, hair_ornament, yellow_eyes, medium_breasts, hair_between_eyes, orange_eyes。 - 来源: 图像从多个网站爬取,如danbooru, pixiv, zerochan等,由DeepGHS Team提供自动爬虫系统。
数据集包
| 名称 | 图像数量 | 大小 | 下载链接 | 类型 | 描述 |
|---|---|---|---|---|---|
| raw | 500 | 1.27 GiB | 下载 | Waifuc-Raw | 包含元信息的原始数据(最小边对齐至1400像素,如果更大)。 |
| stage3-p480-1200 | 1380 | 2.08 GiB | 下载 | IMG+TXT | 三阶段裁剪数据集,区域不小于480x480像素。 |
数据集使用
- 加载: 提供使用waifuc加载原始数据集的示例代码,包括下载和解压数据集文件,以及使用LocalSource加载数据集。
集群列表
- 集群结果: 列出了标签聚类结果,可能包含可挖掘的服装信息。每个集群包含多个样本,每个样本包含多张图像及其详细标签。
集群示例
| # | 样本数 | 图像标签(部分) |
|---|---|---|
| 0 | 16 | 1girl, bandaged_arm, chest_tattoo, looking_at_viewer, night_sky, obi, orange_kimono, red_choker, solo |
| 1 | 5 | 1girl, arm_tattoo, bandaged_arm, bandaged_leg, bare_shoulders, chest_sarashi, fireworks, hadanugi_dousa |
| 2 | 5 | 1girl, arm_tattoo, bandaged_leg, chest_tattoo, hadanugi_dousa, looking_at_viewer, obi, orange_kimono |
| 3 | 22 | 1girl, arm_tattoo, bandaged_arm, hadanugi_dousa, shimenawa, solo, chest_tattoo, holding_bow_(weapon) |
| 4 | 5 | 1girl, arm_tattoo, blush, chest_sarashi, chest_tattoo, closed_mouth, hadanugi_dousa, looking_at_viewer |
| 5 | 8 | 1girl, alternate_costume, long_sleeves, looking_at_viewer, midriff, navel, solo, stomach |
| 6 | 11 | 1girl, alternate_costume, looking_at_viewer, pleated_skirt, solo, white_shirt, sailor_collar, serafuku |
搜集汇总
数据集介绍

构建方式
在动漫角色图像数据集的构建领域,CyberHarem/yoimiya_genshin数据集通过自动化爬虫系统从多个知名艺术平台(如Danbooru、Pixiv、Zerochan等)系统性地采集原始图像。采集过程中,系统对图像进行了标准化预处理,确保最小边缘对齐至1400像素以维持视觉一致性,并剔除了角色的核心标签以优化数据纯度。原始数据包包含500张带有元信息(如文件名和标签)的图像,为后续处理提供了高质量的基础素材。
特点
该数据集专注于《原神》角色宵宮(Yoimiya)的视觉呈现,其核心特征体现在多维度标注体系与结构化数据组织上。数据集不仅提供原始图像与对应标签,还包含经过三阶段裁剪处理的版本,确保每张图像区域不低于480×480像素,从而适配不同分辨率的模型训练需求。此外,数据集通过聚类分析展示了角色在不同服饰、姿态与场景下的视觉模式,例如传统橘色和服与现代交替服饰的细分类别,为风格化生成任务提供了细粒度的语义参照。
使用方法
针对文本到图像生成任务,研究者可通过下载原始压缩包并利用Waifuc工具加载本地数据源,直接访问图像及其元标签信息。数据集支持两种应用模式:原始数据包适用于需要完整元信息的自定义处理流程,而预处理后的裁剪版本则便于直接投入模型训练。用户可依据聚类结果筛选特定视觉主题的图像子集,例如专注于传统服饰或特定姿态的样本,从而针对性地优化生成模型的风格一致性与细节还原能力。
背景与挑战
背景概述
在数字艺术与生成式人工智能蓬勃发展的时代,针对特定主题的高质量图像数据集成为推动文本到图像模型精细化生成能力的关键资源。CyberHarem/yoimiya_genshin数据集由DeepGHS团队构建,专注于汇聚热门游戏《原神》中角色“宵宫”的二次创作视觉资料。该数据集核心旨在解决角色一致性图像生成中的训练数据稀缺问题,通过系统化爬取与标注,为社区提供了包含500张原始图像及其丰富标签的结构化资源。其创建不仅服务于风格化角色图像的生成研究,亦为动漫艺术领域的计算机视觉应用提供了宝贵的基准数据。
当前挑战
该数据集所应对的领域挑战在于,如何从海量且风格各异的二次创作中,精准构建能够表征特定动漫角色视觉一致性的高质量训练集。这涉及对角色核心特征(如发色、服饰、装饰)的稳定识别与标注,以支撑模型学习并生成符合预期的图像。在构建过程中,挑战主要体现在多源数据爬取与清洗的复杂性上,包括从Danbooru、Pixiv等平台获取图像时面临的格式不一、标签噪声以及版权与内容适宜性审查问题。此外,为确保数据质量,需对图像进行尺寸标准化与内容裁剪,并剔除不相关或低质量的样本,这一过程对自动化处理系统的鲁棒性提出了较高要求。
常用场景
经典使用场景
在动漫风格图像生成领域,该数据集以其精细标注的角色特征图像,为文本到图像生成模型提供了高质量的微调素材。通过整合来自多个知名平台的图像资源,数据集聚焦于《原神》角色宵宫,涵盖了多样化的姿态、服饰与场景,使得研究者能够训练模型精准捕捉特定角色的视觉特征,从而生成风格一致且细节丰富的动漫图像。
衍生相关工作
围绕该数据集,衍生了一系列专注于动漫角色生成的经典研究工作,例如基于稳定扩散模型的角色定制化微调方法、结合标签聚类技术的图像风格迁移算法,以及利用多源数据融合提升生成多样性的训练框架。这些工作进一步拓展了数据集的学术价值,推动了动漫图像生成领域的模型优化与应用创新。
数据集最近研究
最新研究方向
在数字艺术与生成式人工智能的交叉领域,角色专属图像数据集正成为推动风格化内容生成研究的关键资源。以《原神》角色宵宫为主题的数据集,凭借其精细标注的视觉特征与多源采集的样本,为文本到图像生成模型提供了高质量的微调基础。当前研究聚焦于利用此类数据集探索角色一致性保持、多姿态与服饰的泛化生成,以及跨文化动漫风格的适应性学习。随着社区对个性化数字内容需求的增长,这类数据集在促进生成模型的艺术表现力与可控性方面展现出显著潜力,为游戏角色衍生创作与虚拟形象设计提供了技术支撑。
以上内容由遇见数据集搜集并总结生成



