SnakeCLEF 2021
收藏hdl.handle.net2025-03-24 收录
下载链接:
http://hdl.handle.net/20.500.12800/1-4773
下载链接
链接失效反馈官方服务:
资源简介:
The dataset with 409,679 images belonging to 772 snake species from 188 countries and all continents (386,006 images with labels targeted for development and 23,673 images without labels for testing). In addition, we provide a simple train/val (90% / 10%) split to validate preliminary results while ensuring the same species distributions. Furthermore, we prepared a compact subset (70,208 images) for fast prototyping. The test set data consists of 23,673 images submitted to the iNaturalist platform within the "first four months of 2021.
All data were gathered from online biodiversity platforms (i.e., iNaturalist, HerpMapper) and further extended by data scraped from Flickr. The provided dataset has a heavy long-tailed class distribution, where the most frequent species (Thamnophis sirtalis) is represented by 22,163 images and the least frequent by just 10 (Achalinus formosanus).
本数据集包含来自188个国家及所有大洲的772种蛇类的409,679张图像,其中针对开发目的的图像带有标签,共计386,006张,而用于测试的图像则无标签,共有23,673张。此外,我们还提供了一个简单的训练/验证集划分(90%训练,10%验证),以确保物种分布的一致性,并用于初步结果的验证。同时,我们还准备了一个紧凑的子集(70,208张图像),以便于快速原型设计。测试集数据由2021年前四个月内在iNaturalist平台提交的23,673张图像组成。所有数据均从在线生物多样性平台(例如iNaturalist、HerpMapper)收集,并通过从Flickr爬取的数据进一步扩展。该数据集呈现出明显的长尾分布特征,其中最常见的物种(Thamnophis sirtalis)由22,163张图像表示,而最少见的物种(Achalinus formosanus)仅有10张图像。
提供机构:
hdl.handle.net



