imagenet-1k-wds
收藏魔搭社区2026-05-08 更新2025-01-11 收录
下载链接:
https://modelscope.cn/datasets/timm/imagenet-1k-wds
下载链接
链接失效反馈官方服务:
资源简介:
## Dataset Description
- **Homepage:** https://image-net.org/index.php
- **Paper:** https://arxiv.org/abs/1409.0575
### Dataset Summary
ILSVRC 2012, commonly known as 'ImageNet' is an image dataset organized according to the WordNet hierarchy. Each meaningful concept in WordNet, possibly described by multiple words or word phrases, is called a "synonym set" or "synset". There are more than 100,000 synsets in WordNet, majority of them are nouns (80,000+). ImageNet aims to provide on average 1000 images to illustrate each synset. Images of each concept are quality-controlled and human-annotated.
💡 This dataset provides access to ImageNet (ILSVRC) 2012 which is the most commonly used **subset** of ImageNet. This dataset spans 1000 object classes and contains 1,281,167 training images, 50,000 validation images and 100,000 test images. The version also has the [patch](https://drive.google.com/file/d/16RYnHpVOW0XKCsn3G3S9GTHUyoV2-4WX/view) which fixes some of the corrupted test set images already applied. For full ImageNet dataset presented in [[2]](https://ieeexplore.ieee.org/abstract/document/5206848), please check the download section of the [main website](https://image-net.org/download-images.php).
### Data Splits
Unlike ImageNet-1k (ILSVRC 2012), the full ImageNet dataset has no defined splits. This subset includes a validation split consiting of 40 samples per 11821 classes.
#### Train
* `imagenet1k-train-{0000..1023}.tar`
* 1281167 samples over 1024 shards
#### Validation
* `imagenet1k-validation-{0000..0063}.tar`
* 50000 samples over 63 shards
### Processing
The webdataset shards were converted from TFDS shards matching the splits in TFDS ImageNet-1k.
## Additional Information
### Dataset Curators
Authors of [[1]](https://arxiv.org/abs/1409.0575) and [[2]](https://ieeexplore.ieee.org/abstract/document/5206848):
- Olga Russakovsky
- Jia Deng
- Hao Su
- Jonathan Krause
- Sanjeev Satheesh
- Wei Dong
- Richard Socher
- Li-Jia Li
- Kai Li
- Sean Ma
- Zhiheng Huang
- Andrej Karpathy
- Aditya Khosla
- Michael Bernstein
- Alexander C Berg
- Li Fei-Fei
### Licensing Information
In exchange for permission to use the ImageNet database (the "Database") at Princeton University and Stanford University, Researcher hereby agrees to the following terms and conditions:
1. Researcher shall use the Database only for non-commercial research and educational purposes.
1. Princeton University and Stanford University make no representations or warranties regarding the Database, including but not limited to warranties of non-infringement or fitness for a particular purpose.
1. Researcher accepts full responsibility for his or her use of the Database and shall defend and indemnify the ImageNet team, Princeton University, and Stanford University, including their employees, Trustees, officers and agents, against any and all claims arising from Researcher's use of the Database, including but not limited to Researcher's use of any copies of copyrighted images that he or she may create from the Database.
1. Researcher may provide research associates and colleagues with access to the Database provided that they first agree to be bound by these terms and conditions.
1. Princeton University and Stanford University reserve the right to terminate Researcher's access to the Database at any time.
1. If Researcher is employed by a for-profit, commercial entity, Researcher's employer shall also be bound by these terms and conditions, and Researcher hereby represents that he or she is fully authorized to enter into this agreement on behalf of such employer.
1. The law of the State of New Jersey shall apply to all disputes under this agreement.
### Citation Information
```bibtex
@article{imagenet15russakovsky,
Author = {Olga Russakovsky and Jia Deng and Hao Su and Jonathan Krause and Sanjeev Satheesh and Sean Ma and Zhiheng Huang and Andrej Karpathy and Aditya Khosla and Michael Bernstein and Alexander C. Berg and Li Fei-Fei},
Title = { {ImageNet Large Scale Visual Recognition Challenge} },
Year = {2015},
journal = {International Journal of Computer Vision (IJCV)},
doi = {10.1007/s11263-015-0816-y},
volume={115},
number={3},
pages={211-252}
}
```
## 数据集说明
- **主页:** https://image-net.org/index.php
- **论文:** https://arxiv.org/abs/1409.0575
### 数据集概述
ILSVRC 2012,通常被称为「ImageNet」,是一款依据**词网(WordNet)**层级结构组织的图像数据集。词网中每个具备语义的概念(可由多个单词或词组描述)被称为「同义词集(synonym set,简称synset)」。词网内共包含超过10万个同义词集,其中绝大多数为名词(超8万个)。ImageNet旨在为每个同义词集平均提供1000张图像,每个概念对应的图像均经过质量管控与人工标注。
💡 本数据集提供的是最为常用的ImageNet子集——ImageNet(ILSVRC)2012。该数据集涵盖1000个物体类别,包含1281167张训练图像、50000张验证图像与100000张测试图像。此版本还附带了[补丁](https://drive.google.com/file/d/16RYnHpVOW0XKCsn3G3S9GTHUyoV2-4WX/view),该补丁已完成部分损坏测试集图像的修复工作。如需获取[[2]](https://ieeexplore.ieee.org/abstract/document/5206848)中提及的完整ImageNet数据集,请访问[主网站](https://image-net.org/download-images.php)的下载板块。
### 数据划分
与ImageNet-1k(即ILSVRC 2012)不同,完整的ImageNet数据集并未定义固定的数据划分方式。本子集包含的验证划分中,每11821个类别对应40个样本。
#### 训练集
* 文件命名格式:`imagenet1k-train-{0000..1023}.tar`
* 共1024个数据分片,包含1281167个样本
#### 验证集
* 文件命名格式:`imagenet1k-validation-{0000..0063}.tar`
* 共63个数据分片,包含50000个样本
### 数据处理
本数据集的**Web数据集(WebDataset)**分片均由与TFDS(TensorFlow Datasets)ImageNet-1k划分一致的TFDS分片转换得到。
## 补充信息
### 数据集制作者
[[1]](https://arxiv.org/abs/1409.0575)与[[2]](https://ieeexplore.ieee.org/abstract/document/5206848)的作者:
- Olga Russakovsky
- Jia Deng
- Hao Su
- Jonathan Krause
- Sanjeev Satheesh
- Wei Dong
- Richard Socher
- Li-Jia Li
- Kai Li
- Sean Ma
- Zhiheng Huang
- Andrej Karpathy
- Aditya Khosla
- Michael Bernstein
- Alexander C Berg
- Li Fei-Fei
### 授权协议
在普林斯顿大学与斯坦福大学许可使用ImageNet数据库(下称「本数据库」)的前提下,研究人员同意遵守以下条款与条件:
1. 研究人员仅可将本数据库用于非商业性研究与教育用途。
2. 普林斯顿大学与斯坦福大学不对本数据库作出任何陈述或担保,包括但不限于非侵权性或特定用途适用性的担保。
3. 研究人员需对其使用本数据库的行为承担全部责任,并需就因研究人员使用本数据库(包括但不限于研究人员从本数据库制作的任何受版权保护的图像副本的使用)所引发的任何及所有索赔,为ImageNet团队、普林斯顿大学及斯坦福大学(包括其雇员、受托人、管理人员与代理人)进行辩护并承担赔偿责任。
4. 研究人员可向研究助手与同事提供本数据库的访问权限,但前提是这些人员需先同意遵守本条款与条件。
5. 普林斯顿大学与斯坦福大学保留随时终止研究人员使用本数据库的权限。
6. 若研究人员受雇于营利性商业实体,则该雇主亦需遵守本条款与条件,且研究人员在此声明其已获得充分授权,可代表该雇主签署本协议。
7. 本协议下的所有争议均适用新泽西州法律。
### 引用信息
bibtex
@article{imagenet15russakovsky,
Author = {Olga Russakovsky and Jia Deng and Hao Su and Jonathan Krause and Sanjeev Satheesh and Sean Ma and Zhiheng Huang and Andrej Karpathy and Aditya Khosla and Michael Bernstein and Alexander C. Berg and Li Fei-Fei},
Title = { {ImageNet Large Scale Visual Recognition Challenge} },
Year = {2015},
journal = {International Journal of Computer Vision (IJCV)},
doi = {10.1007/s11263-015-0816-y},
volume={115},
number={3},
pages={211-252}
}
提供机构:
maas
创建时间:
2025-01-08



