five

CyberHarem/homika_pokemon

收藏
Hugging Face2024-01-16 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/CyberHarem/homika_pokemon
下载链接
链接失效反馈
官方服务:
资源简介:
这是一个名为homika (Pokémon)的数据集,包含500张图片及其标签。图片从多个网站(如danbooru、pixiv、zerochan等)爬取,并由DeepGHS团队提供支持。数据集的核心标签包括`white_hair, hair_ornament, blue_eyes, freckles, short_hair`,这些标签在数据集中被修剪。README还提供了数据集的下载链接和加载方法,并展示了标签聚类结果。

This is a dataset named homika (Pokémon) that contains 500 paired images and their corresponding labels. The images were crawled from multiple websites such as danbooru, pixiv, zerochan and others, and supported by the DeepGHS team. The pruned core tags of the dataset include `white_hair, hair_ornament, blue_eyes, freckles, short_hair`. The accompanying README provides the dataset's download link, loading method and showcases the tag clustering results.
提供机构:
CyberHarem
原始信息汇总

数据集概述

基本信息

  • 数据集名称: Dataset of homika (Pokémon)
  • 许可证: MIT
  • 任务类别: text-to-image
  • 标签: art, not-for-all-audiences
  • 数据量: n<1K

数据内容

  • 图像数量: 500
  • 核心标签: white_hair, hair_ornament, blue_eyes, freckles, short_hair

数据包列表

名称 图像数量 大小 类型 描述
raw 500 509.16 MiB Waifuc-Raw 原始数据,包含元信息(最小边对齐到1400像素,如果更大)
800 500 310.13 MiB IMG+TXT 短边不超过800像素的数据集
stage3-p480-800 1103 603.59 MiB IMG+TXT 3阶段裁剪数据集,区域不小于480x480像素
1200 500 453.88 MiB IMG+TXT 短边不超过1200像素的数据集
stage3-p480-1200 1103 813.42 MiB IMG+TXT 3阶段裁剪数据集,区域不小于480x480像素

加载原始数据集

  • 工具: waifuc

  • 代码示例: python import os import zipfile from huggingface_hub import hf_hub_download from waifuc.source import LocalSource

    下载原始归档文件

    zip_file = hf_hub_download( repo_id=CyberHarem/homika_pokemon, repo_type=dataset, filename=dataset-raw.zip, )

    提取文件到指定目录

    dataset_dir = dataset_dir os.makedirs(dataset_dir, exist_ok=True) with zipfile.ZipFile(zip_file, r) as zf: zf.extractall(dataset_dir)

    使用waifuc加载数据集

    source = LocalSource(dataset_dir) for item in source: print(item.image, item.meta[filename], item.meta[tags])

标签聚类结果

  • 样本示例:
    # 样本数量 图像1 图像2 图像3 图像4 图像5 标签
    0 9 样本0 样本1 样本2 样本3 样本4 1girl, bass_guitar, hair_bobbles, striped_dress, topknot, boots, pokemon_(creature), open_mouth, smile
    1 7 样本0 样本1 样本2 样本3 样本4 1girl, bass_guitar, forehead, hair_bobbles, open_mouth, smile, solo, striped_dress, topknot, holding_instrument, looking_at_viewer, boots, teeth, two-tone_dress, strapless_dress
    2 7 样本0 样本1 样本2 样本3 样本4 1girl, bass_guitar, forehead, hair_bobbles, strapless_dress, striped_dress, topknot, holding_instrument, looking_at_viewer, open_mouth, smile, solo, tongue, plectrum, teeth, two-tone_dress, boots, platform_footwear, spiked_hair
    3 10 样本0 样本1 样本2 样本3 样本4 1girl, bass_guitar, boots, hair_bobbles, open_mouth, striped_dress, topknot, forehead, holding_instrument, pokemon_(creature), smile, spiked_hair, teeth, tongue, two-tone_dress, looking_at_viewer, plectrum
    4 5 样本0 样本1 样本2 样本3 样本4 1girl, hair_bobbles, looking_at_viewer, strapless_dress, striped_dress, tongue_out, topknot, bass_guitar, forehead, solo, platform_boots, two-tone_dress
    5 5 样本0 样本1 样本2 样本3 样本4 1girl, bass_guitar, hair_bobbles, holding_poke_ball, striped_dress, topknot, short_dress, solo, poke_ball_(basic), bra_strap, grin, platform_boots, standing
    6 5 样本0 样本1 样本2 样本3 样本4 1girl, forehead, hair_bobbles, pokemon_(creature), smile, striped_dress, topknot, open_mouth
    7 5 样本0 样本1 样本2 样本3 样本4 1girl, blush, hair_bobbles, kemonomimi_mode, topknot, cat_ears, cat_tail, open_mouth, solo, fang, striped
    8 11 样本0 样本1 样本2 样本3 样本4 1girl, hair_bobbles, navel, small_breasts, topknot, solo, blush, looking_at_viewer, nipples, bikini, nude, pussy, smile, full_body, spread_legs
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作