ELIE – Entomological Label Information Extraction
收藏DataCite Commons2025-07-07 更新2026-05-04 收录
下载链接:
https://zenodo.org/records/15730765
下载链接
链接失效反馈官方服务:
资源简介:
ELIE (Entomological Label Information Extraction) is a semi-automated pipeline designed to extract structured metadata from digitized insect specimen labels using deep learning and OCR. This overarching dataset compiles and standardizes annotated labels from seven major digitization projects, supporting research in biodiversity informatics and AI. The pipeline uses CNNs, Tesseract, Google Vision OCR, and clustering, achieving up to 91.6% accuracy on printed labels. ELIE accelerates museum digitization and enhances access to legacy biodiversity data.
提供机构:
Anonymous
创建时间:
2025-06-24



