five

diffusers-parti-prompts/muse512

收藏
Hugging Face2023-09-18 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/diffusers-parti-prompts/muse512
下载链接
链接失效反馈
官方服务:
资源简介:
--- configs: - config_name: default data_files: - split: train path: data/train-* dataset_info: features: - name: Prompt dtype: string - name: Category dtype: string - name: Challenge dtype: string - name: Note dtype: string - name: images dtype: image - name: model_name dtype: string - name: seed dtype: int64 splits: - name: train num_bytes: 128701081.568 num_examples: 1632 download_size: 127769152 dataset_size: 128701081.568 --- # Dataset Card for "muse_512" ```py ```py from PIL import Image import torch from muse import PipelineMuse, MaskGiTUViT from datasets import Dataset, Features from datasets import Image as ImageFeature from datasets import Value, load_dataset device = "cuda" if torch.cuda.is_available() else "cpu" pipe = PipelineMuse.from_pretrained( transformer_path="valhalla/research-run", text_encoder_path="openMUSE/clip-vit-large-patch14-text-enc", vae_path="openMUSE/vqgan-f16-8192-laion", ).to(device) pipe.transformer = MaskGiTUViT.from_pretrained("valhalla/research-run-finetuned-journeydb", revision="06bcd6ab6580a2ed3275ddfc17f463b8574457da", subfolder="ema_model").to(device) pipe.tokenizer.pad_token_id = 49407 if device == "cuda": pipe.transformer.enable_xformers_memory_efficient_attention() pipe.text_encoder.to(torch.float16) pipe.transformer.to(torch.float16) import PIL def main(): print("Loading dataset...") parti_prompts = load_dataset("nateraw/parti-prompts", split="train") print("Loading pipeline...") seed = 0 device = "cuda" torch.manual_seed(0) ckpt_id = "openMUSE/muse-512" scale = 10 print("Running inference...") main_dict = {} for i in range(len(parti_prompts)): sample = parti_prompts[i] prompt = sample["Prompt"] image = pipe( prompt, timesteps=16, negative_text=None, guidance_scale=scale, temperature=(2, 0), orig_size=(512, 512), crop_coords=(0, 0), aesthetic_score=6, use_fp16=device == "cuda", transformer_seq_len=1024, use_tqdm=False, )[0] image = image.resize((256, 256), resample=PIL.Image.Resampling.LANCZOS) img_path = f"/home/patrick/muse_images/muse_512_{i}.png" image.save(img_path) main_dict.update( { prompt: { "img_path": img_path, "Category": sample["Category"], "Challenge": sample["Challenge"], "Note": sample["Note"], "model_name": ckpt_id, "seed": seed, } } ) def generation_fn(): for prompt in main_dict: prompt_entry = main_dict[prompt] yield { "Prompt": prompt, "Category": prompt_entry["Category"], "Challenge": prompt_entry["Challenge"], "Note": prompt_entry["Note"], "images": {"path": prompt_entry["img_path"]}, "model_name": prompt_entry["model_name"], "seed": prompt_entry["seed"], } print("Preparing HF dataset...") ds = Dataset.from_generator( generation_fn, features=Features( Prompt=Value("string"), Category=Value("string"), Challenge=Value("string"), Note=Value("string"), images=ImageFeature(), model_name=Value("string"), seed=Value("int64"), ), ) ds_id = "diffusers-parti-prompts/muse512" ds.push_to_hub(ds_id) if __name__ == "__main__": main() ```
提供机构:
diffusers-parti-prompts
原始信息汇总

数据集卡片 "muse_512"

数据集配置

  • 配置名称: default
  • 数据文件:
    • 分割: train
    • 路径: data/train-*

数据集信息

  • 特征:

    • 名称: Prompt
      • 数据类型: string
    • 名称: Category
      • 数据类型: string
    • 名称: Challenge
      • 数据类型: string
    • 名称: Note
      • 数据类型: string
    • 名称: images
      • 数据类型: image
    • 名称: model_name
      • 数据类型: string
    • 名称: seed
      • 数据类型: int64
  • 分割:

    • 名称: train
      • 字节数: 128701081.568
      • 样本数: 1632
  • 下载大小: 127769152

  • 数据集大小: 128701081.568

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作