five

BINDER: On Identification and Retrieval of Near-Duplicate Biological Images: a New Dataset and Protocol

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://doi.org/10.7910/DVN/9YTD9Z
下载链接
链接失效反馈
官方服务:
资源简介:
Abstract: Manipulation and re-use of images in scientific publications is a growing issue, not only for biomedical publishers but also for the research community in general. In this work, we introduce BINDER – Bio-Image Near-Duplicate Examples Repository, a novel dataset to help researchers develop, train, and test models to detect same-source biomedical images. BINDER contains 7,490 unique image patches – the training set – as well as 1,821 patches – split in validation and test sets – with accompanying manipulations obtained by means of transformations including rotation, translation, scale, perspective transform, contrast adjustment, and/or compression artifacts. In addition, we show how novel adaptations of existing image retrieval and metric learning models, when trained on this dataset, can be applied to achieve high-accuracy inference results, creating a baseline for future work. In aggregate, we thus present a supervised protocol for near-duplicate image identification and retrieval without any “real-world” training example. This is part of the Image Forensics project: Image Forensics
创建时间:
2021-06-16
二维码
社区交流群
二维码
科研交流群
商业服务