five

Insect Detect - insect classification dataset v2

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8325383
下载链接
链接失效反馈
官方服务:
资源简介:
The Insect Detect - insect classification dataset v2 contains mainly images of various insects sitting on or flying above an artificial flower platform. All images were automatically recorded with the Insect Detect DIY camera trap, a hardware combination of the Luxonis OAK-1, Raspberry Pi Zero 2 W and PiJuice Zero pHAT for automated insect monitoring (bioRxiv preprint). Most of the images were captured by camera traps deployed at different sites in 2023. For some classes (e.g. ant, bee_bombus, beetle_cocci, bug, bug_grapho, hfly_eristal, hfly_myathr, hfly_syrphus) additional images were captured with a lab setup of the camera trap. For some classes (e.g. bee_apis, fly, hfly_episyr, wasp) images from the first dataset version were transferred to this dataset. This dataset is also available on Roboflow Universe. The images in the dataset from Roboflow are automatically compressed, which decreases model accuracy when used for training. Therefore it is recommended to use this uncompressed Zenodo version and split the dataset into train/val/test subsets in the provided training notebook.   Classes This dataset contains the following 27 classes: ant (Formicidae) bee (Anthophila excluding Apis mellifera and Bombus sp.) bee_apis (Apis mellifera) bee_bombus (Bombus sp.) beetle (Coleoptera excluding Coccinellidae and some Oedemeridae) beetle_cocci (Coccinellidae) beetle_oedem (visually distinct Oedemeridae) bug (Heteroptera excluding Graphosoma italicum) bug_grapho (Graphosoma italicum) fly (Brachycera excluding Empididae, Sarcophagidae, Syrphidae and small Brachycera) fly_empi (Empididae) fly_sarco (visually distinct Sarcophagidae) fly_small (small Brachycera) hfly_episyr (hoverfly Episyrphus balteatus) hfly_eristal (hoverfly Eristalis sp., mainly Eristalis tenax) hfly_eupeo (mainly hoverfly Eupeodes corollae and Scaeva pyrastri) hfly_myathr (hoverfly Myathropa florea) hfly_sphaero (hoverfly Sphaerophoria sp., mainly Sphaerophoria scripta) hfly_syrphus (mainly hoverfly Syrphus sp.) lepi (Lepidoptera) none_bg (images with no insect - background (platform)) none_bird (images with no insect - bird sitting on platform) none_dirt (images with no insect - leaves and other plant material, bird droppings) none_shadow (images with no insect - shadows of insects or surrounding plants) other (other Arthropods, including various Hymenoptera and Symphyta, Diptera, Orthoptera, Auchenorrhyncha, Neuroptera, Araneae) scorpionfly (Panorpa sp.) wasp (mainly Vespula sp. and Polistes dominula) For the classes hfly_eupeo and hfly_syrphus a precise taxonomic distinction is not possible with images only, due to a potentially high variability in the appearance of the respective species. While most specimens will show the visual features that are important for a classification into one of these classes, some specimens of Syrphus sp. might look more like Eupeodes sp. and vice versa. The images were sorted to the respective class by considering taxonomic and visual distinctions. However, this dataset is still rather small regarding the visually extremely diverse Insecta. Insects that are not included in this dataset can therefore be classified to the wrong class. All results should always be manually validated and false classifications can be used to extend this basic dataset and retrain your custom classification model.   Deployment You can use this dataset as starting point to train your own insect classification models with the provided Google Colab training notebook. Read the model training instructions for more information. A insect classification model trained on this dataset is available in the insect-detect-ml GitHub repo. To deploy the model on your PC (ONNX format for fast CPU inference), follow the provided step-by-step instructions.   License This dataset is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0).
创建时间:
2023-12-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作