five

Supporting data for "“UDE DIATOMS in the Wild 2024”: A new image dataset of freshwater diatoms for training deep learning models"

收藏
DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/102580
下载链接
链接失效反馈
官方服务:
资源简介:
Diatoms are microalgae with finely ornamented microscopic silica shells. Their taxonomic identification by light microscopy is routinely used as part of community ecological research as well as ecological status assessment of aquatic ecosystems, and a need for digitalisation of these methods has long been recognized. Alongside their high taxonomic and morphological diversity, several other factors make diatoms highly challenging for deep learning-based identification using light microscopy images. These include a) an unusually high intra-class variability combined with small between-class differences; b) a rather different visual appearance of specimens depending on their orientation on the microscope slide; and c) the limited availability of diatom experts for accurate taxonomic annotation. <br>We present the largest diatom image dataset thus far, aimed at facilitating the application and benchmarking of innovative deep learning methods to the diatom identification problem on realistic research data, UDE DIATOMS in the Wild 2024. The dataset contains 83,570 images of 611 diatom taxa, 101 of which are represented by at least 100 examples, and 144 by at least 50 examples each. We showcase this dataset in two innovative analyses that address individual aspects of the above challenges using subclustering to deal with visually heterogeneous classes, out-of-distribution sample detection and semi-supervised learning. <br>The problem of image-based identification of diatoms is both important for environmental research, and challenging from the machine learning perspective. By making available the so far largest image data set, accompanied by innovative analyses, this contribution will facilitate addressing these points by the scientific community.
提供机构:
GigaScience Database
创建时间:
2024-09-25
二维码
社区交流群
二维码
科研交流群
商业服务