---
license: unknown
size_categories: n<1K
task_categories:
- image-classification
pretty_name: SSB (hard)
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
dataset_info:
features:
- name: image
dtype: image
splits:
- name: train
num_bytes: 64639425.0
num_examples: 208
download_size: 0
dataset_size: 64639425.0
---
# Dataset Card for SSB (hard) for OOD Detection
<!-- Provide a quick summary of the dataset. -->
## Dataset Details
### Dataset Description
<!-- Provide a longer summary of what this dataset is. -->
- **Original Dataset Authors**: Sagar Vaze, Kai Han, Andrea Vedaldi, Andrew Zisserman
- **OOD Split Authors:** Julian Bitterwolf, Maximilian Müller, Matthias Hein
- **Shared by:** Eduardo Dadalto
- **License:** unknown
### Dataset Sources
<!-- Provide the basic links for the dataset. -->
- **Original Dataset Paper:** http://arxiv.org/abs/2110.06207v2
- **First OOD Application Paper:** http://arxiv.org/abs/2306.00826v1
### Direct Use
<!-- This section describes suitable use cases for the dataset. -->
This dataset is intended to be used as an ouf-of-distribution dataset for image classification benchmarks.
### Out-of-Scope Use
<!-- This section addresses misuse, malicious use, and uses that the dataset will not work well for. -->
This dataset is not annotated.
### Curation Rationale
<!-- Motivation for the creation of this dataset. -->
The goal in curating and sharing this dataset to the HuggingFace Hub is to accelerate research and promote reproducibility in generalized Out-of-Distribution (OOD) detection.
Check the python library [detectors](https://github.com/edadaltocg/detectors) if you are interested in OOD detection.
### Personal and Sensitive Information
<!-- State whether the dataset contains data that might be considered personal, sensitive, or private (e.g., data that reveals addresses, uniquely identifiable names or aliases, racial or ethnic origins, sexual orientations, religious beliefs, political opinions, financial or health data, etc.). If efforts were made to anonymize the data, describe the anonymization process. -->
Please check original paper for details on the dataset.
### Bias, Risks, and Limitations
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
Please check original paper for details on the dataset.
## Citation
<!-- If there is a paper or blog post introducing the dataset, the APA and Bibtex information for that should go in this section. -->
**BibTeX:**
```bibtex
@software{detectors2023,
author = {Eduardo Dadalto},
title = {Detectors: a Python Library for Generalized Out-Of-Distribution Detection},
url = {https://github.com/edadaltocg/detectors},
doi = {https://doi.org/10.5281/zenodo.7883596},
month = {5},
year = {2023}
}
@article{2306.00826v1,
author = {Julian Bitterwolf and Maximilian Müller and Matthias Hein},
title = {In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation},
year = {2023},
month = {6},
note = {ICML 2023. Datasets, code and evaluation data at
https://github.com/j-cb/NINCO},
archiveprefix = {arXiv},
url = {http://arxiv.org/abs/2306.00826v1}
}
```
## Dataset Card Authors
Eduardo Dadalto
## Dataset Card Contact
https://huggingface.co/edadaltocg
license: 未知
size_categories: 样本量小于1000
task_categories:
- 图像分类
pretty_name: SSB(困难版)
configs:
- config_name: 默认配置
data_files:
- split: 训练集
path: data/train-*
dataset_info:
features:
- name: 图像
dtype: 图像
splits:
- name: 训练集
num_bytes: 64639425.0字节
num_examples: 208
download_size: 0字节
dataset_size: 64639425.0字节
# 面向分布外(Out-of-Distribution, OOD)检测的SSB(困难版)数据集卡片
<!-- 请提供该数据集的简要概述。 -->
## 数据集详情
### 数据集描述
<!-- 请提供该数据集的详细说明。 -->
- **原始数据集作者**:萨加尔·瓦泽(Sagar Vaze)、韩凯(Kai Han)、安德里亚·韦达利(Andrea Vedaldi)、安德鲁·齐斯曼(Andrew Zisserman)
- **分布外划分作者**:朱利安·比特沃夫(Julian Bitterwolf)、马克西米利安·穆勒(Maximilian Müller)、马蒂亚斯·海因(Matthias Hein)
- **共享者**:爱德华多·达达托(Eduardo Dadalto)
- **许可证**:未知
### 数据集来源
<!-- 请提供该数据集的基础链接。 -->
- **原始数据集论文**:http://arxiv.org/abs/2110.06207v2
- **首篇分布外检测应用论文**:http://arxiv.org/abs/2306.00826v1
### 直接用途
<!-- 本小节描述该数据集的适用使用场景。 -->
本数据集旨在作为图像分类基准测试的分布外(Out-of-Distribution, OOD)数据集。
### 超范围使用
<!-- 本小节说明该数据集的误用、恶意使用,以及无法适配的使用场景。 -->
本数据集未进行标注。
### 数据集构建初衷
<!-- 创建该数据集的动机说明。 -->
本数据集整理并共享至Hugging Face Hub的目标,是加速通用分布外检测(Out-of-Distribution, OOD)领域的研究,并提升相关研究的可复现性。
若您对分布外检测研究感兴趣,可查阅Python库[detectors](https://github.com/edadaltocg/detectors)。
### 个人与敏感信息说明
<!-- 说明数据集是否包含可被视为个人、敏感或隐私的数据(例如:揭示地址、唯一可识别的姓名或别名、种族或族裔出身、性取向、宗教信仰、政治观点、财务或健康数据等)。若已对数据进行匿名化处理,请说明匿名化流程。 -->
有关数据集的详细信息,请查阅原始论文。
### 偏倚、风险与局限性
<!-- 本小节旨在说明技术与社会技术层面的局限性。 -->
有关数据集的偏倚、风险与局限性细节,请查阅原始论文。
## 引用信息
<!-- 若有介绍该数据集的论文或博客文章,请在此处提供其APA和BibTeX格式引用信息。 -->
**BibTeX:**
bibtex
@software{detectors2023,
author = {Eduardo Dadalto},
title = {Detectors: a Python Library for Generalized Out-Of-Distribution Detection},
url = {https://github.com/edadaltocg/detectors},
doi = {https://doi.org/10.5281/zenodo.7883596},
month = {5},
year = {2023}
}
@article{2306.00826v1,
author = {Julian Bitterwolf and Maximilian Müller and Matthias Hein},
title = {In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation},
year = {2023},
month = {6},
note = {ICML 2023. Datasets, code and evaluation data at
https://github.com/j-cb/NINCO},
archiveprefix = {arXiv},
url = {http://arxiv.org/abs/2306.00826v1}
}
## 数据集卡片作者
爱德华多·达达托(Eduardo Dadalto)
## 数据集卡片联系方式
https://huggingface.co/edadaltocg