Recod.ai Scientific Image Integrity Dataset (RSIID)
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/records/15095089
下载链接
链接失效反馈官方服务:
资源简介:
The Recod.ai Scientific Image Integrity Dataset (RSIID) is a benchmark dataset designed for evaluating forgery detection methods in scientific images. This dataset comprises 39,423 synthetically tampered figures, derived from 2,923 pristine scientific images sourced from Creative Commons repositories. The dataset is divided into training (26,496 figures) and testing (12,927 figures) sets, all licensed under Creative Commons Attribution (CC-BY).
The RSIID is structured by forgery modality and figure complexity, categorized into "Simple" and "Compound" figures:
Simple Scientific Figures: These include forgeries created through Retouching (Blurring, Contrast adjustments), Cleaning (Inpainting, Brute-force removal), and Duplication (Copy-Move, Splicing, Overlap).
Compound Scientific Figures: These figures consist of multiple panels, where forgeries can occur within a single panel (intra-forgery) or between panels (inter-forgery).
Additional Resources:
This repository also includes:
Source Images and Metadata: The original, untampered images used to create the dataset, along with a spreadsheet (artificial_forgery_src_data.zip) detailing the source of each image.
Compound Forgery Templates: Templates used for creating the compound forgeries (template.zip).
Related Content:
Research Article - Benchmarking Scientific Image Forgery Detectors
GitHub Repository - Recod.ai Scientific Image Integrity Library
Citation
The dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.
If you use any content from this repository, please cite:
@article{cardenuto_2022,
title={Benchmarking scientific image forgery detectors},
volume={28}, DOI={10.1007/s11948-022-00391-4},
number={4},
journal={Science and Engineering Ethics},
author={Cardenuto, João P. and Rocha, Anderson}, year={2022}
}
创建时间:
2025-03-27



