Dataset for reproducing BadMIM
收藏Figshare2025-11-19 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Dataset_for_reproducing_BadMIM/29979010/1
下载链接
链接失效反馈官方服务:
资源简介:
This dataset includes all necessary data to reproduct the proformance of BadMIM on CIFAR10.Specifically, for training auxiliary models, auxiliary datasets are provided for each target class in CIFAR10, which are constructed using Caltech256 and web-sourced target-class images. Notably, the "dog" class already exists in Caltech256, so the auxiliary dataset for attacking the "dog" class contains only Caltech256 images. For trigger augmentation (Adversarial Trigger Augmentation), ten triggers corresponding to each target class were adversarially augmented using their respective auxiliary models and datasets to produce the final augmented triggers.For backdoor injection (Reconstruction Hijacking), a shadow dataset (Imagenette2) and the augmented triggers are used to train a backdoored MIM encoder. This encoder is then fine-tuned on downstream tasks, such as CIFAR10.In summary, this collection contains three types of datasets:<br>1) Auxiliary datasets: Caltech256 combined with web-sourced target-class images;<br>2) Shadow dataset: Imagenette2, a small subset of ImageNet;<br>3) Downstream dataset: Only CIFAR10 is provided here.<br>4) Triggers: Original triggers and augmented triggers.
提供机构:
Li, Yang
创建时间:
2025-11-19



