five

vrachit/imagenet-1k-webdataset

收藏
Hugging Face2025-12-26 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/vrachit/imagenet-1k-webdataset
下载链接
链接失效反馈
官方服务:
资源简介:
# ImageNet-1k WebDataset This dataset contains ImageNet-1k in WebDataset format (tar files) for efficient streaming. ## Dataset Structure - **Training**: 129 shards (train-*.tar) - **Validation**: 5 shards (validation-*.tar) - **Total size**: 147.82 GB ## Format Each tar file contains samples with: - `*.jpg`: Image bytes - `*.cls`: Label (class ID as text) ## Usage ```python import webdataset as wds # Training dataset train_url = "train-{000000..000000000}.tar" dataset = wds.WebDataset(train_url).shuffle(1000).decode("rgb") for sample in dataset: image = sample["jpg"] # PIL Image label = int(sample["cls"]) # ... your training code ``` ## Source Converted from ILSVRC/imagenet-1k parquet format. Generated using [MetaDistil ImageNet Pipeline](https://modal.com)
提供机构:
vrachit
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作