five

Label Memorization of CIFAR-10N

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14687825
下载链接
链接失效反馈
官方服务:
资源简介:
These files contain results from running heldout estimation on the CIFAR-10N dataset with the rand1 noise type, using 1500 ResNet models. The cifar-rand1-human-infl-mem.npz file contains results for human noisy labels, while cifar-rand1-syn-infl-mem.npz contains results for synthetic noisy labels generated using the same class transition matrix.   Dictionary Keys 1. total_runs Type: Integer Description: The total number of training runs included in this aggregated result. 2. trainset_mask Type: array (Boolean, shape: [total_runs, train_size]) Description: A mask indicating which training examples were used (True) or held out (False) during each training run. 3. trainset_correctness Key: trainset_correctness Type: array (Boolean, shape: [total_runs, train_size]) Description: Whether the model correctly predicted the label for each training example during each run. 4. trainset_predictions Type: array (Integer, shape: [total_runs, train_size]) Description: The predicted class labels for each training example during each run. 5. testset_correctness Type: array (Boolean, shape: [total_runs, test_size]) Description: Whether the model correctly predicted the label for each test example during each run. 6. testset_predictions Type: array (Integer, shape: [total_runs, test_size]) Description: The predicted class labels for each test example during each run. 7. memorization Type: array (Float, shape: [train_size]) Description: Memorization score of each training example, computed across all runs. 8. influence Type: array (Float, shape: [test_size, train_size]) Description: Influence scores of each training example on each test example. 9. memorization_inclusion_prob Type: array (Float, shape: [train_size]) Description: Probability that the training example is predicted correctly when included. 10. memorization_exclusion_prob Type: array (Float, shape: [train_size]) Description: Probability that the training example is predicted correctly when excluded. Usage To load the file and access its contents: import numpy as np # Load the file data = np.load('cifar-rand1-human-agg-infl-mem.npz') # Access individual components total_runs = data['total_runs'] trainset_mask = data['trainset_mask'] memorization = data['memorization'] Notes Our results were aggregated over 1500 training runs. For more details or questions, feel free to reach out!
创建时间:
2025-03-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作