Scaled and Translated Image Recognition (STIR) Source Data
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7351725
下载链接
链接失效反馈官方服务:
资源简介:
While convolutions are known to be invariant to (discrete) translations, scaling continues to be a challenge and most image recognition networks are not invariant to them. To explore these effects, we have created the Scaled and Translated Image Recognition (STIR) dataset. This dataset contains objects of size \(s \in [17,64]\), each randomly placed in a \(64 \times 64\) pixel image.
Original Source Data
dota/ (from DOTA v1.5 Google Drive website)
train/
DOTA-v1.5_train.zip not unzipped
part1.zip not unzipped
part2.zip not unzipped
part3.zip not unzipped
val/
DOTA-v1.5_val.zip not unzipped
part1.zip not unzipped
fontawesome/ (from Font Awesome 5.15.3 "Free for Desktop")
svgs/ unzipped from archive
mapillary/ (from Mapillary Traffic Sign Dataset)
mtsd_v2_fully_annotated unzipped from archive
train.0.zip not unzipped
train.1.zip not unzipped
train.2.zip not unzipped
val.zip not unzipped
mnist/ (from Yann LeCun website)
t10k-images-idx3-ubyte.gz
t10k-labels-idx1-ubyte.gz
train-images-idx3-ubyte.gz
train-labels-idx1-ubyte.gz
License and Attribution
When using the original source data for your own research, please respect the individual licenses. For attribution in papers, we recommend the following citations which introduce the respective datasets.
D. Gandy, J. Otero, E. Emanuel, F. Botsford, J. Lundien, K. Jackson, M. Wilkerson, R. Madole, J. Raphael, T. Chase, G. Taglialatela, B. Talbot, and T. Chase. Font Awesome. https://fontawesome.com/v5/download, Nov. 2022.
Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proc. IEEE, 86(11):2278–2324, Nov. 1998.
C. Ertler, J. Mislej, T. Ollmann, L. Porzi, G. Neuhold, and Y. Kuang. The Mapillary Traffic Sign Dataset for Detection and Classification on a Global Scale. In 2020 16th Eur. Conf. Comput. Vision (ECCV), Glasgow, UK, Aug. 2020.
G.-S. Xia, X. Bai, J. Ding, Z. Zhu, S. Belongie, J. Luo, M. Datcu, M. Pelillo, and L. Zhang. DOTA: A Large-Scale Dataset for Object Detection in Aerial Images. In 2018 IEEE/CVF Conf. Comput. Vision and Pattern Recognition (CVPR), pages 3974–3983, Salt Lake City, UT, USA, June 2018.
创建时间:
2022-11-24



