five

Scaled and Translated Image Recognition (STIR) Source Data

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7351725
下载链接
链接失效反馈
官方服务:
资源简介:
While convolutions are known to be invariant to (discrete) translations, scaling continues to be a challenge and most image recognition networks are not invariant to them. To explore these effects, we have created the Scaled and Translated Image Recognition (STIR) dataset. This dataset contains objects of size \(s \in [17,64]\), each randomly placed in a \(64 \times 64\) pixel image. Original Source Data dota/ (from DOTA v1.5 Google Drive website) train/ DOTA-v1.5_train.zip not unzipped part1.zip not unzipped part2.zip not unzipped part3.zip not unzipped val/ DOTA-v1.5_val.zip not unzipped part1.zip not unzipped fontawesome/ (from Font Awesome 5.15.3 "Free for Desktop") svgs/ unzipped from archive mapillary/ (from Mapillary Traffic Sign Dataset) mtsd_v2_fully_annotated unzipped from archive train.0.zip not unzipped train.1.zip not unzipped train.2.zip not unzipped val.zip not unzipped mnist/ (from Yann LeCun website) t10k-images-idx3-ubyte.gz t10k-labels-idx1-ubyte.gz train-images-idx3-ubyte.gz train-labels-idx1-ubyte.gz License and Attribution When using the original source data for your own research, please respect the individual licenses. For attribution in papers, we recommend the following citations which introduce the respective datasets. D. Gandy, J. Otero, E. Emanuel, F. Botsford, J. Lundien, K. Jackson, M. Wilkerson, R. Madole, J. Raphael, T. Chase, G. Taglialatela, B. Talbot, and T. Chase. Font Awesome. https://fontawesome.com/v5/download, Nov. 2022. Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proc. IEEE, 86(11):2278–2324, Nov. 1998.  C. Ertler, J. Mislej, T. Ollmann, L. Porzi, G. Neuhold, and Y. Kuang. The Mapillary Traffic Sign Dataset for Detection and Classification on a Global Scale. In 2020 16th Eur. Conf. Comput. Vision (ECCV), Glasgow, UK, Aug. 2020. G.-S. Xia, X. Bai, J. Ding, Z. Zhu, S. Belongie, J. Luo, M. Datcu, M. Pelillo, and L. Zhang. DOTA: A Large-Scale Dataset for Object Detection in Aerial Images. In 2018 IEEE/CVF Conf. Comput. Vision and Pattern Recognition (CVPR), pages 3974–3983, Salt Lake City, UT, USA, June 2018.
创建时间:
2022-11-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作