five

pokemon-blip-captions

收藏
魔搭社区2026-05-07 更新2024-05-15 收录
下载链接:
https://modelscope.cn/datasets/AI-ModelScope/pokemon-blip-captions
下载链接
链接失效反馈
官方服务:
资源简介:
# Dataset Card for Pokémon BLIP captions _Dataset used to train [Pokémon text to image model](https://github.com/LambdaLabsML/examples/tree/main/stable-diffusion-finetuning)_ BLIP generated captions for Pokémon images from Few Shot Pokémon dataset introduced by _Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis_ (FastGAN). Original images were obtained from [FastGAN-pytorch](https://github.com/odegeasslbc/FastGAN-pytorch) and captioned with the [pre-trained BLIP model](https://github.com/salesforce/BLIP). For each row the dataset contains `image` and `text` keys. `image` is a varying size PIL jpeg, and `text` is the accompanying text caption. Only a train split is provided. ## 下载数据集 ``` git clone http://oauth2:ZbY47KP-R6z84fEufZfY@www.modelscope.cn/datasets/AI-ModelScope/pokemon-blip-captions.git ``` ## 示例代码 ```python from modelscope import MsDataset from modelscope.utils.constant import DownloadMode ds = MsDataset.load('AI-ModelScope/pokemon-blip-captions',subset_name='default', split='train', download_mode=DownloadMode.FORCE_REDOWNLOAD, cache_dir='/mnt/workspace/cache_yk') print(next(iter(ds))) ``` ## Examples ![pk1.jpg](https://s3.amazonaws.com/moonup/production/uploads/1663756580442-62bd5f951e22ec84279820e8.jpeg) > a drawing of a green pokemon with red eyes ![pk10.jpg](https://s3.amazonaws.com/moonup/production/uploads/1663756580225-62bd5f951e22ec84279820e8.jpeg) > a green and yellow toy with a red nose ![pk100.jpg](https://s3.amazonaws.com/moonup/production/uploads/1663756579985-62bd5f951e22ec84279820e8.jpeg) > a red and white ball with an angry look on its face

# 宝可梦BLIP标注数据集卡片 _本数据集用于训练[宝可梦文本到图像模型](https://github.com/LambdaLabsML/examples/tree/main/stable-diffusion-finetuning)_ BLIP生成的标注文本来自由《Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis》(FastGAN)提出的少样本宝可梦数据集。原始图像取自[FastGAN-pytorch](https://github.com/odegeasslbc/FastGAN-pytorch),并通过[预训练BLIP模型](https://github.com/salesforce/BLIP)生成标注。 数据集中每一条目均包含`image`与`text`两个键:`image`为尺寸可变的PIL格式JPEG图像,`text`为对应的文本标注。本数据集仅提供训练拆分。 ## 数据集下载 git clone http://oauth2:ZbY47KP-R6z84fEufZfY@www.modelscope.cn/datasets/AI-ModelScope/pokemon-blip-captions.git ## 示例代码 python from modelscope import MsDataset from modelscope.utils.constant import DownloadMode ds = MsDataset.load('AI-ModelScope/pokemon-blip-captions',subset_name='default', split='train', download_mode=DownloadMode.FORCE_REDOWNLOAD, cache_dir='/mnt/workspace/cache_yk') print(next(iter(ds))) ## 示例 ![pk1.jpg](https://s3.amazonaws.com/moonup/production/uploads/1663756580442-62bd5f951e22ec84279820e8.jpeg) > 一幅带有红色眼睛的绿色宝可梦绘图 ![pk10.jpg](https://s3.amazonaws.com/moonup/production/uploads/1663756580225-62bd5f951e22ec84279820e8.jpeg) > 一个带有红色鼻子的绿黄色玩具 ![pk100.jpg](https://s3.amazonaws.com/moonup/production/uploads/1663756579985-62bd5f951e22ec84279820e8.jpeg) > 一个表面带有愤怒表情的红白配色球体
提供机构:
maas
创建时间:
2023-12-13
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集用于训练宝可梦文本到图像模型,基于Few Shot Pokémon数据集,通过BLIP模型自动生成图像描述。数据集包含图像和对应的文本描述键值对,仅提供训练分割,适用于图像生成任务的训练。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作