Single-Shot Optical Neural Network

Mendeley Data2024-05-17 更新2024-06-27 收录

下载链接：

https://zenodo.org/records/7827147

下载链接

链接失效反馈

官方服务：

资源简介：

Data from classification of MNIST [1], Fashion-MNIST [2] and QuickDraw [3] images reported in "Single-Shot Optical Neural Network" by L. Bernstein et al. The networks of 784 -> N (-> N) -> 10 activations performed inference on the test sets through consecutive matrix products implemented on the optical hardware, with ReLU applied electronically between each layer (see main text for more details). Folders for each tested network contain the following text files: Inputs: 2D matrices of size B x 784 containing training (B = 50,000 or 100,000), validation (B = 10,000) and test (B = 10,000) sets. Each row is an input vector that can be reshaped into an input image of size 28 x 28. True labels: B-length vectors containing the true label of each input in the training, validation and test sets. Neural network weights: 2D matrices of size K x N used in inference experiments to classify the test sets. Weights were pre-trained on the training set using a digital electronic computer as described in Materials and Methods. Weight values were normalized such that all values fall between -255 and 255. In the optical neural network, the weighting SLM displays the absolute values of the weights (rounded to the nearest integer), and the negative weight signs are applied in post-processing. The 32-bit weight values were used for inference performed on the digital electronic computer for the ground truth comparison. Outputs (normalized): 2D output matrices of size 10,000 x 10 from the networks processed on a digital electronic computer (ground truth) and the optical neural network. Each row is an output vector where the position of the maximum value indicates the predicted label of the input in the same row of the test set. Predicted labels: 10,000-element vectors that represent the labels predicted by the networks processed on a digital electronic computer (ground truth) and the optical neural network. These predicted labels were used to generate the confusion matrices and calculate the classification accuracies (versus the true labels). The classes for the Fashion-MNIST dataset are the following: 0: T-shirt 1: Trouser 2: Pullover 3: Dress 4: Coat 5: Sandal 6: Shirt 7: Sneaker 8: Bag 9: Ankle boot And the randomly selected classes for QuickDraw are: 0: Hourglass 1: Saw 2: Golf club 3: See saw 4: Spoon 5: Horse 6: Onion 7: Light bulb 8: Harp 9: Flip flops [1] Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition. Proceedings of the IEEE 86, 2278–2324 (1998). [2] H. Xiao, K. Rasul, R. Vollgraf, Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms. Preprint at https://arxiv.org/abs/1708.07747 (2017). [3] J. Jongejan, H. Rowley, T. Kawashima, J. Kim, N. Fox-Gieg, The Quick, Draw! AI experiment, https://quickdraw.withgoogle.com/ (2016).

创建时间：

2023-06-28

5,000+

优质数据集

54 个

任务类型

进入经典数据集