five

BINDER: On Identification and Retrieval of Near-Duplicate Biological Images: a New Dataset and Protocol

收藏
DataONE2021-06-16 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:d11232d32d5e8ca656e8427ba0d4c28d10b8c3d90e085850902d42d66fcb5bd9
下载链接
链接失效反馈
官方服务:
资源简介:
Abstract: Manipulation and re-use of images in scientific publications is a growing issue, not only for biomedical publishers but also for the research community in general. In this work, we introduce BINDER – Bio-Image Near-Duplicate Examples Repository, a novel dataset to help researchers develop, train, and test models to detect same-source biomedical images. BINDER contains 7,490 unique image patches – the training set – as well as 1,821 patches – split in validation and test sets – with accompanying manipulations obtained by means of transformations including rotation, translation, scale, perspective transform, contrast adjustment, and/or compression artifacts. In addition, we show how novel adaptations of existing image retrieval and metric learning models, when trained on this dataset, can be applied to achieve high-accuracy inference results, creating a baseline for future work. In aggregate, we thus present a supervised protocol for near-duplicate image identification and retrieval without any “real-world” training example. This is part of the Image Forensics project: Image Forensics
创建时间:
2023-11-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作