five

Birchlabs/wds-dataset-viewer-test

收藏
Hugging Face2023-10-22 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Birchlabs/wds-dataset-viewer-test
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 pretty_name: OpenAI guided-diffusion 256px class-conditional unguided samples (20 samples) size_categories: - n<1K --- Read from the webdataset (after saving it somewhere on your disk) like this: ```python from webdataset import WebDataset from typing import TypedDict, Iterable from PIL import Image from PIL.PngImagePlugin import PngImageFile from io import BytesIO from os import makedirs Example = TypedDict('Example', { '__key__': str, '__url__': str, 'img.png': bytes, }) dataset = WebDataset('./wds-dataset-viewer-test/{00000..00001}.tar') out_root = 'out' makedirs(out_root, exist_ok=True) it: Iterable[Example] = iter(dataset) for ix, item in enumerate(it): with BytesIO(item['img.png']) as stream: img: PngImageFile = Image.open(stream) img.load() img.save(f'{out_root}/{ix}.png') ``` Or from the HF dataset like this: ```python from datasets import load_dataset from datasets.dataset_dict import DatasetDict from datasets.arrow_dataset import Dataset from PIL.PngImagePlugin import PngImageFile from typing import TypedDict, Iterable from os import makedirs class Item(TypedDict): index: int tar: str tar_path: str img: PngImageFile dataset: DatasetDict = load_dataset('Birchlabs/wds-dataset-viewer-test') train: Dataset = dataset['train'] out_root = 'out' makedirs(out_root, exist_ok=True) it: Iterable[Item] = iter(train) for item in it: item['img'].save(f'{out_root}/{item["index"]}.png') ```
提供机构:
Birchlabs
原始信息汇总

数据集概述

数据集名称

OpenAI guided-diffusion 256px class-conditional unguided samples (20 samples)

数据集大小

n<1K

许可证

apache-2.0

数据集读取示例

从WebDataset读取

python from webdataset import WebDataset from typing import TypedDict, Iterable from PIL import Image from PIL.PngImagePlugin import PngImageFile from io import BytesIO from os import makedirs

Example = TypedDict(Example, { key: str, url: str, img.png: bytes, })

dataset = WebDataset(./wds-dataset-viewer-test/{00000..00001}.tar)

out_root = out makedirs(out_root, exist_ok=True)

it: Iterable[Example] = iter(dataset) for ix, item in enumerate(it): with BytesIO(item[img.png]) as stream: img: PngImageFile = Image.open(stream) img.load() img.save(f{out_root}/{ix}.png)

从HF数据集读取

python from datasets import load_dataset from datasets.dataset_dict import DatasetDict from datasets.arrow_dataset import Dataset from PIL.PngImagePlugin import PngImageFile from typing import TypedDict, Iterable from os import makedirs

class Item(TypedDict): index: int tar: str tar_path: str img: PngImageFile

dataset: DatasetDict = load_dataset(Birchlabs/wds-dataset-viewer-test) train: Dataset = dataset[train]

out_root = out makedirs(out_root, exist_ok=True)

it: Iterable[Item] = iter(train) for item in it: item[img].save(f{out_root}/{item["index"]}.png)

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作