Scaled and Translated Image Recognition (STIR) Source Data

NIAID Data Ecosystem2026-03-14 收录

下载链接：

https://zenodo.org/record/7351725

下载链接

链接失效反馈

官方服务：

资源简介：

While convolutions are known to be invariant to (discrete) translations, scaling continues to be a challenge and most image recognition networks are not invariant to them. To explore these effects, we have created the Scaled and Translated Image Recognition (STIR) dataset. This dataset contains objects of size \(s \in [17,64]\), each randomly placed in a \(64 \times 64\) pixel image. Original Source Data dota/ (from DOTA v1.5 Google Drive website) train/ DOTA-v1.5_train.zip not unzipped part1.zip not unzipped part2.zip not unzipped part3.zip not unzipped val/ DOTA-v1.5_val.zip not unzipped part1.zip not unzipped fontawesome/ (from Font Awesome 5.15.3 "Free for Desktop") svgs/ unzipped from archive mapillary/ (from Mapillary Traffic Sign Dataset) mtsd_v2_fully_annotated unzipped from archive train.0.zip not unzipped train.1.zip not unzipped train.2.zip not unzipped val.zip not unzipped mnist/ (from Yann LeCun website) t10k-images-idx3-ubyte.gz t10k-labels-idx1-ubyte.gz train-images-idx3-ubyte.gz train-labels-idx1-ubyte.gz License and Attribution When using the original source data for your own research, please respect the individual licenses. For attribution in papers, we recommend the following citations which introduce the respective datasets. D. Gandy, J. Otero, E. Emanuel, F. Botsford, J. Lundien, K. Jackson, M. Wilkerson, R. Madole, J. Raphael, T. Chase, G. Taglialatela, B. Talbot, and T. Chase. Font Awesome. https://fontawesome.com/v5/download, Nov. 2022. Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proc. IEEE, 86(11):2278–2324, Nov. 1998. C. Ertler, J. Mislej, T. Ollmann, L. Porzi, G. Neuhold, and Y. Kuang. The Mapillary Traffic Sign Dataset for Detection and Classification on a Global Scale. In 2020 16th Eur. Conf. Comput. Vision (ECCV), Glasgow, UK, Aug. 2020. G.-S. Xia, X. Bai, J. Ding, Z. Zhu, S. Belongie, J. Luo, M. Datcu, M. Pelillo, and L. Zhang. DOTA: A Large-Scale Dataset for Object Detection in Aerial Images. In 2018 IEEE/CVF Conf. Comput. Vision and Pattern Recognition (CVPR), pages 3974–3983, Salt Lake City, UT, USA, June 2018.

创建时间：

2022-11-24

5,000+

优质数据集

54 个

任务类型

进入经典数据集