five

A medicinal plant leaf image dataset for plant health condition detection and classification

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/89rfgtxbdc
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains a curated collection of medicinal plant leaf images developed for research in plant disease detection, computer vision, and machine learning applications. The images were collected from three locations in Dhaka, Bangladesh—Rajbari, Ashulia, and Mirpur—between January 7, 2026 and February 27, 2026 using smartphone cameras (OnePlus Nord CE 4 Lite and iPhone 16 Pro Max). During data collection, detached leaves were placed on a uniform background to improve visibility of leaf morphology and disease symptoms. The dataset includes images from five medicinal plant species: Aloe Vera, Azadirachta Indica (Neem), Hibiscus Rosa Sinensis, Kalanchoe Pinnata, and Piper Betle, covering 16 leaf condition classes such as healthy, chlorotic, diseased, dried, and different growth stages. In total, the dataset contains 1,323 original images captured during field collection and 14,677 augmented images, resulting in 16,000 images. The original images were captured in high resolution (3072 x 4096 pixels, 4096 x 3072 pixels and 3024 x 4032 pixels), and all processed images were standardized to 512 × 512 pixels, converted to RGB color format, and stored in JPG format to ensure compatibility with machine learning and deep learning models. During preprocessing, background removal techniques were applied to isolate the leaf region and reduce irrelevant visual noise, while pixel values were normalized to maintain consistent image quality. Data augmentation techniques, including rotation, horizontal and vertical flipping, brightness and contrast adjustment, Gaussian noise addition, and image sharpening were applied to increase dataset diversity and improve class balance. The dataset is organized into two main directories: Original Images, which contain the raw captured leaf images, and Processed Images, which include resized, normalized, and augmented samples generated from the original dataset. Additionally, a CSV metadata file is included that provides structured information such as plant species names, leaf condition labels, image counts, and data collection locations, enabling easier dataset management and supporting reproducible machine learning experiments.
创建时间:
2026-03-09
二维码
社区交流群
二维码
科研交流群
商业服务