CyberHarem/toujou_nozomi_lovelive
收藏Hugging Face2024-01-17 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/CyberHarem/toujou_nozomi_lovelive
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- text-to-image
tags:
- art
- not-for-all-audiences
size_categories:
- n<1K
---
# Dataset of toujou_nozomi/東條希 (Love Live!)
This is the dataset of toujou_nozomi/東條希 (Love Live!), containing 500 images and their tags.
The core tags of this character are `purple_hair, long_hair, green_eyes, breasts, twintails, large_breasts, low_twintails, hair_ornament, bangs`, which are pruned in this dataset.
Images are crawled from many sites (e.g. danbooru, pixiv, zerochan ...), the auto-crawling system is powered by [DeepGHS Team](https://github.com/deepghs)([huggingface organization](https://huggingface.co/deepghs)).
## List of Packages
| Name | Images | Size | Download | Type | Description |
|:-----------------|---------:|:-----------|:------------------------------------------------------------------------------------------------------------------------|:-----------|:---------------------------------------------------------------------|
| raw | 500 | 708.81 MiB | [Download](https://huggingface.co/datasets/CyberHarem/toujou_nozomi_lovelive/resolve/main/dataset-raw.zip) | Waifuc-Raw | Raw data with meta information (min edge aligned to 1400 if larger). |
| 800 | 500 | 400.67 MiB | [Download](https://huggingface.co/datasets/CyberHarem/toujou_nozomi_lovelive/resolve/main/dataset-800.zip) | IMG+TXT | dataset with the shorter side not exceeding 800 pixels. |
| stage3-p480-800 | 1208 | 840.45 MiB | [Download](https://huggingface.co/datasets/CyberHarem/toujou_nozomi_lovelive/resolve/main/dataset-stage3-p480-800.zip) | IMG+TXT | 3-stage cropped dataset with the area not less than 480x480 pixels. |
| 1200 | 500 | 622.35 MiB | [Download](https://huggingface.co/datasets/CyberHarem/toujou_nozomi_lovelive/resolve/main/dataset-1200.zip) | IMG+TXT | dataset with the shorter side not exceeding 1200 pixels. |
| stage3-p480-1200 | 1208 | 1.16 GiB | [Download](https://huggingface.co/datasets/CyberHarem/toujou_nozomi_lovelive/resolve/main/dataset-stage3-p480-1200.zip) | IMG+TXT | 3-stage cropped dataset with the area not less than 480x480 pixels. |
### Load Raw Dataset with Waifuc
We provide raw dataset (including tagged images) for [waifuc](https://deepghs.github.io/waifuc/main/tutorials/installation/index.html) loading. If you need this, just run the following code
```python
import os
import zipfile
from huggingface_hub import hf_hub_download
from waifuc.source import LocalSource
# download raw archive file
zip_file = hf_hub_download(
repo_id='CyberHarem/toujou_nozomi_lovelive',
repo_type='dataset',
filename='dataset-raw.zip',
)
# extract files to your directory
dataset_dir = 'dataset_dir'
os.makedirs(dataset_dir, exist_ok=True)
with zipfile.ZipFile(zip_file, 'r') as zf:
zf.extractall(dataset_dir)
# load the dataset with waifuc
source = LocalSource(dataset_dir)
for item in source:
print(item.image, item.meta['filename'], item.meta['tags'])
```
## List of Clusters
List of tag clustering result, maybe some outfits can be mined here.
### Raw Text Version
| # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | Tags |
|----:|----------:|:----------------------------------|:----------------------------------|:----------------------------------|:----------------------------------|:----------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 0 | 16 |  |  |  |  |  | 1girl, smile, solo, looking_at_viewer, white_dress, blush, flower, wedding_dress, bouquet, cleavage, elbow_gloves, jewelry, bare_shoulders, bridal_veil, garter_straps, open_mouth, thighhighs, tiara, white_gloves |
| 1 | 11 |  |  |  |  |  | 1girl, looking_at_viewer, smile, solo, blush, skirt, hair_flower, navel, twin_braids, aqua_eyes, black_thighhighs, card, cleavage, dated, earrings, frills, holding, ribbon, very_long_hair |
| 2 | 14 |  |  |  |  |  | 1girl, blue_skirt, blush, looking_at_viewer, otonokizaka_school_uniform, pleated_skirt, solo, white_shirt, plaid_skirt, collared_shirt, hair_scrunchie, green_bowtie, striped_bowtie, pink_scrunchie, simple_background, smile, short_sleeves, summer_uniform, white_background, miniskirt, sweater_vest, black_thighhighs, long_sleeves, winter_uniform, zettai_ryouiki |
| 3 | 8 |  |  |  |  |  | 1girl, blazer, otonokizaka_school_uniform, solo, striped_bowtie, upper_body, winter_uniform, smile, blush, looking_at_viewer, pink_scrunchie, green_bowtie, long_sleeves, blue_jacket, collared_shirt, white_shirt |
| 4 | 40 |  |  |  |  |  | 1girl, solo, hair_flower, looking_at_viewer, crown, dress, smile, single_braid, vines, bare_shoulders, hair_over_shoulder, very_long_hair, thighhighs |
| 5 | 6 |  |  |  |  |  | 1girl, braid, solo, blush, looking_at_viewer, smile, hair_over_shoulder, mini_top_hat, open_mouth, skirt |
| 6 | 7 |  |  |  |  |  | 1girl, blush, looking_at_viewer, smile, solo, frills, white_gloves, aqua_eyes, earrings, parted_bangs, choker, idol, purple_dress, scrunchie, sparkle, bow, collarbone, maid_headdress, skirt, stage, star_(symbol) |
| 7 | 6 |  |  |  |  |  | 1girl, cleavage, solo, heart, looking_at_viewer, maid_headdress, blush, smile, thighhighs, very_long_hair |
| 8 | 12 |  |  |  |  |  | 1girl, solo, cloud, day, looking_at_viewer, ocean, outdoors, navel, smile, beach, blush, cleavage, frilled_bikini, hair_flower, open_mouth, blue_sky, bracelet, collarbone |
| 9 | 7 |  |  |  |  |  | 1girl, hakama_skirt, looking_at_viewer, miko, red_hakama, solo, smile, blush, wide_sleeves, outdoors, aqua_eyes, holding_broom, kimono, open_mouth, ribbon, shrine |
| 10 | 5 |  |  |  |  |  | 1girl, elbow_gloves, witch_hat, black_gloves, black_thighhighs, solo, high_heels, looking_at_viewer, smile, star_hair_ornament, aqua_eyes, capelet, card, cleavage, dress, halloween, ribbon, skirt |
| 11 | 6 |  |  |  |  |  | 1girl, kimono, looking_at_viewer, smile, solo, aqua_eyes, floral_print, obi, blush, wide_sleeves, alternate_hairstyle, hair_bow, hair_flower |
| 12 | 5 |  |  |  |  |  | 1girl, blush, navel, solo, cleavage, looking_at_viewer, underwear_only, very_long_hair, pink_bra, pink_panties, smile, thigh_gap, white_bra |
### Table Version
| # | Samples | Img-1 | Img-2 | Img-3 | Img-4 | Img-5 | 1girl | smile | solo | looking_at_viewer | white_dress | blush | flower | wedding_dress | bouquet | cleavage | elbow_gloves | jewelry | bare_shoulders | bridal_veil | garter_straps | open_mouth | thighhighs | tiara | white_gloves | skirt | hair_flower | navel | twin_braids | aqua_eyes | black_thighhighs | card | dated | earrings | frills | holding | ribbon | very_long_hair | blue_skirt | otonokizaka_school_uniform | pleated_skirt | white_shirt | plaid_skirt | collared_shirt | hair_scrunchie | green_bowtie | striped_bowtie | pink_scrunchie | simple_background | short_sleeves | summer_uniform | white_background | miniskirt | sweater_vest | long_sleeves | winter_uniform | zettai_ryouiki | blazer | upper_body | blue_jacket | crown | dress | single_braid | vines | hair_over_shoulder | braid | mini_top_hat | parted_bangs | choker | idol | purple_dress | scrunchie | sparkle | bow | collarbone | maid_headdress | stage | star_(symbol) | heart | cloud | day | ocean | outdoors | beach | frilled_bikini | blue_sky | bracelet | hakama_skirt | miko | red_hakama | wide_sleeves | holding_broom | kimono | shrine | witch_hat | black_gloves | high_heels | star_hair_ornament | capelet | halloween | floral_print | obi | alternate_hairstyle | hair_bow | underwear_only | pink_bra | pink_panties | thigh_gap | white_bra |
|----:|----------:|:----------------------------------|:----------------------------------|:----------------------------------|:----------------------------------|:----------------------------------|:--------|:--------|:-------|:--------------------|:--------------|:--------|:---------|:----------------|:----------|:-----------|:---------------|:----------|:-----------------|:--------------|:----------------|:-------------|:-------------|:--------|:---------------|:--------|:--------------|:--------|:--------------|:------------|:-------------------|:-------|:--------|:-----------|:---------|:----------|:---------|:-----------------|:-------------|:-----------------------------|:----------------|:--------------|:--------------|:-----------------|:-----------------|:---------------|:-----------------|:-----------------|:--------------------|:----------------|:-----------------|:-------------------|:------------|:---------------|:---------------|:-----------------|:-----------------|:---------|:-------------|:--------------|:--------|:--------|:---------------|:--------|:---------------------|:--------|:---------------|:---------------|:---------|:-------|:---------------|:------------|:----------|:------|:-------------|:-----------------|:--------|:----------------|:--------|:--------|:------|:--------|:-----------|:--------|:-----------------|:-----------|:-----------|:---------------|:-------|:-------------|:---------------|:----------------|:---------|:---------|:------------|:---------------|:-------------|:---------------------|:----------|:------------|:---------------|:------|:----------------------|:-----------|:-----------------|:-----------|:---------------|:------------|:------------|
| 0 | 16 |  |  |  |  |  | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 1 | 11 |  |  |  |  |  | X | X | X | X | | X | | | | X | | | | | | | | | | X | X | X | X | X | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 2 | 14 |  |  |  |  |  | X | X | X | X | | X | | | | | | | | | | | | | | | | | | | X | | | | | | | | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 3 | 8 |  |  |  |  |  | X | X | X | X | | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | X | | X | | X | | X | X | X | | | | | | | X | X | | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 4 | 40 |  |  |  |  |  | X | X | X | X | | | | | | | | | X | | | | X | | | | X | | | | | | | | | | | X | | | | | | | | | | | | | | | | | | | | | | | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 5 | 6 |  |  |  |  |  | X | X | X | X | | X | | | | | | | | | | X | | | | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 6 | 7 |  |  |  |  |  | X | X | X | X | | X | | | | | | | | | | | | | X | X | | | | X | | | | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | X | X | X | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 7 | 6 |  |  |  |  |  | X | X | X | X | | X | | | | X | | | | | | | X | | | | | | | | | | | | | | | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | X | | | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
| 8 | 12 |  |  |  |  |  | X | X | X | X | | X | | | | X | | | | | | X | | | | | X | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | X | | | | | X | X | X | X | X | X | X | X | | | | | | | | | | | | | | | | | | | | | | |
| 9 | 7 |  |  |  |  |  | X | X | X | X | | X | | | | | | | | | | X | | | | | | | | X | | | | | | | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | X | | | | | X | X | X | X | X | X | X | | | | | | | | | | | | | | | |
| 10 | 5 |  |  |  |  |  | X | X | X | X | | | | | | X | X | | | | | | | | | X | | | | X | X | X | | | | | X | | | | | | | | | | | | | | | | | | | | | | | | | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | X | X | X | X | X | X | | | | | | | | | |
| 11 | 6 |  |  |  |  |  | X | X | X | X | | X | | | | | | | | | | | | | | | X | | | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | X | | X | | | | | | | | X | X | X | X | | | | | |
| 12 | 5 |  |  |  |  |  | X | X | X | X | | X | | | | X | | | | | | | | | | | | X | | | | | | | | | | X | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | X | X | X | X | X |
提供机构:
CyberHarem
原始信息汇总
数据集概述
数据集信息
- 名称: Dataset of toujou_nozomi/東條希 (Love Live!)
- 描述: 包含500张图像及其标签,主题为Love Live!中的角色東條希。
- 核心标签:
purple_hair, long_hair, green_eyes, breasts, twintails, large_breasts, low_twintails, hair_ornament, bangs - 来源: 从多个网站(如danbooru, pixiv, zerochan等)爬取。
- 许可: MIT
- 任务类别: text-to-image
- 标签: art, not-for-all-audiences
- 大小类别: n<1K
数据集包列表
| 名称 | 图像数量 | 大小 | 类型 | 描述 |
|---|---|---|---|---|
| raw | 500 | 708.81 MiB | Waifuc-Raw | 包含元信息的原始数据(最小边对齐到1400像素,如果更大)。 |
| 800 | 500 | 400.67 MiB | IMG+TXT | 短边不超过800像素的数据集。 |
| stage3-p480-800 | 1208 | 840.45 MiB | IMG+TXT | 3阶段裁剪数据集,区域不小于480x480像素。 |
| 1200 | 500 | 622.35 MiB | IMG+TXT | 短边不超过1200像素的数据集。 |
| stage3-p480-1200 | 1208 | 1.16 GiB | IMG+TXT | 3阶段裁剪数据集,区域不小于480x480像素。 |
标签聚类结果
原始文本版本
| # | 样本数量 | 标签 |
|---|---|---|
| 0 | 16 | 1girl, smile, solo, looking_at_viewer, white_dress, blush, flower, wedding_dress, bouquet, cleavage, elbow_gloves, jewelry, bare_shoulders, bridal_veil, garter_straps, open_mouth, thighhighs, tiara, white_gloves |
| 1 | 11 | 1girl, looking_at_viewer, smile, solo, blush, skirt, hair_flower, navel, twin_braids, aqua_eyes, black_thighhighs, card, cleavage, dated, earrings, frills, holding, ribbon, very_long_hair |
| 2 | 14 | 1girl, blue_skirt, blush, looking_at_viewer, otonokizaka_school_uniform, pleated_skirt, solo, white_shirt, plaid_skirt, collared_shirt, hair_scrunchie, green_bowtie, striped_bowtie, pink_scrunchie, simple_background, smile, short_sleeves, summer_uniform, white_background, miniskirt, sweater_vest, black_thighhighs, long_sleeves, winter_uniform, zettai_ryouiki |
| 3 | 8 | 1girl, blazer, otonokizaka_school_uniform, solo, striped_bowtie, upper_body, winter_uniform, smile, blush, looking_at_viewer, pink_scrunchie, green_bowtie, long_sleeves, blue_jacket, collared_shirt, white_shirt |
| 4 | 40 | 1girl, solo, hair_flower, looking_at_viewer, crown, dress, smile, single_braid, vines, bare_shoulders, hair_over_shoulder, very_long_hair, thighhighs |
| 5 | 6 | 1girl, braid, solo, blush, looking_at_viewer, smile, hair_over_shoulder, mini_top_hat, open_mouth, skirt |
| 6 | 7 | 1girl, blush, looking_at_viewer, smile, solo, frills, white_gloves, aqua_eyes, earrings, parted_bangs, choker, idol, purple_dress, scrunchie, sparkle, bow, collarbone, maid_headdress, skirt, stage, star_(symbol) |
| 7 | 6 | 1girl, cleavage, solo, heart, looking_at_viewer, maid_headdress, blush, smile, thighhighs, very_long_hair |
| 8 | 12 | 1girl, solo, cloud, day, looking_at_viewer, ocean, outdoors, navel, smile, beach, blush, cleavage, frilled_bikini, hair_flower, open_mouth, blue_sky, bracelet, collarbone |
| 9 | 7 | 1girl, hakama_skirt, looking_at_viewer, miko, red_hakama, solo, smile, blush, wide_sleeves, outdoors, aqua_eyes, holding_broom, kimono, open_mouth, ribbon, shrine |
| 10 | 5 | 1girl, elbow_gloves, witch_hat, black_gloves, black_thighhighs, solo, high_heels, looking_at_viewer, smile, star_hair_ornament, aqua_eyes, capelet, card, cleavage, dress, halloween, ribbon, skirt |
| 11 | 6 | 1girl, kimono, looking_at_viewer, smile, solo, aqua_eyes, floral_print, obi, blush, wide_sleeves, alternate_hairstyle, hair_bow, hair_flower |
| 12 | 5 | 1girl, blush, navel, solo, cleavage, looking_at_viewer, underwear_only, very_long_hair, pink_bra, pink_panties, smile, thigh_gap, white_bra |
搜集汇总
数据集介绍

构建方式
该数据集聚焦于《Love Live!》中的角色东条希,共收录500张图像及其对应的标签。图像采集自Danbooru、Pixiv、Zerochan等多个平台,借助DeepGHS团队开发的自动化爬取系统完成。数据集对核心标签(如紫色长发、绿色眼眸、双马尾等)进行了精简处理,并提供了多种预处理版本,包括原始数据(含元信息)以及短边不超过800或1200像素的标准化版本。此外,还包含经过三阶段裁剪、面积不低于480×480像素的增强数据集,以满足不同训练需求。
特点
数据集具有鲜明的层次化与结构化特征。除了基础的图像-标签对形式,还提供了标签聚类结果,通过可视化样例和表格形式展示了不同服饰、场景下的标签组合,例如婚纱、校服、泳装等主题簇。聚类结果有助于挖掘角色在不同装扮下的视觉模式,为文本到图像生成任务提供细粒度的语义指导。数据集规模虽小(不足千张),但覆盖了多种姿态、背景和着装风格,兼具多样性与针对性。
使用方法
数据集支持通过Waifuc库进行加载。用户可从HuggingFace下载原始压缩包,解压后利用LocalSource读取图像及其元信息(如文件名、标签)。对于需要标准化尺寸的用例,可直接使用800或1200像素的打包版本。若需关注局部细节,推荐采用三阶段裁剪后的数据集。此外,聚类结果可直接用于分析角色常见视觉特征,辅助模型训练时的标签权重调整或数据增强策略设计。
背景与挑战
背景概述
在文本到图像生成领域,高质量、精细标注的动漫角色数据集对于推动模型生成特定角色形象的能力至关重要。CyberHarem团队于近年构建了toujou_nozomi_lovelive数据集,专注于收录《Love Live!》中人气角色东条希的视觉素材。该数据集由DeepGHS团队主导开发,旨在为二次元角色生成任务提供标准化训练资源。核心研究问题聚焦于如何通过多源爬取(涵盖danbooru、pixiv、zerochan等平台)与自动标注技术,构建包含500张图像及其标签的紧凑型数据集。该数据集的影响力体现在为动漫角色定制化生成、风格迁移及细粒度标签学习提供了基准,尤其服务于waifuc等框架下的模型微调与评估。
当前挑战
当前数据集面临的核心挑战包括:其一,领域问题层面,动漫角色生成任务中,模型需从有限样本(仅500张图像)中泛化出角色在不同着装、场景与姿态下的稳定特征表达,这对数据多样性与标注粒度提出了极高要求;其二,构建过程中,多源爬取导致图像分辨率、构图及标签一致性参差不齐,需通过多阶段裁剪(如stage3-p480-800)与元信息对齐来缓解,但标签冗余(如核心标签经剪枝后仍存在长尾分布)与缺失问题仍难以彻底解决;其三,数据集的规模限制使其难以覆盖角色全部服饰变体(如仅12个聚类),可能制约模型对罕见装扮的生成能力。
常用场景
经典使用场景
该数据集以《Love Live!》中的角色東條希为核心,收录了500张高质量图像及其精细标注的标签,涵盖了从校园制服到舞台表演、巫女装束等多种风格。其经典使用场景在于为二次元角色驱动的文本到图像生成模型提供细粒度的训练样本,尤其适用于需要精准复现角色标志性特征(如紫色长发、双马尾、绿色眼眸)的生成任务。研究者可借助该数据集微调Stable Diffusion等扩散模型,使其在保持角色辨识度的前提下,依据文本描述生成多样化的姿态与情境。
实际应用
在实际应用层面,该数据集可直接服务于虚拟偶像的数字化创作与二次元内容生产。例如,游戏开发团队可利用其训练模型,快速生成東條希在不同活动中的宣传素材,降低美术成本;同人创作者则能通过文本描述获得高保真的角色插画,提升创作效率。此外,该数据集还可用于构建动漫角色数据库,支持基于内容的图像检索或个性化头像生成系统,在娱乐产业中具有广泛的应用潜力。
衍生相关工作
该数据集衍生了一系列经典工作,主要集中在角色定制化生成与数据增强策略上。例如,基于此数据集的研究探索了利用LoRA(Low-Rank Adaptation)技术对预训练模型进行轻量级微调,以实现特定角色的高效生成;同时,其提供的多分辨率版本(如800像素、1200像素)和裁剪策略,启发后续工作对数据集预处理流程进行优化,以平衡计算资源与生成质量。此外,该数据集的标签聚类结果还被用于挖掘角色常见服饰搭配,推动了风格迁移与纹理合成领域的交叉研究。
以上内容由遇见数据集搜集并总结生成



