葡萄多模态目标检测和语义分割数据集

Mendeley Data2024-01-31 更新2024-06-27 收录

下载链接：

https://www.scidb.cn/en/detail?dataSetId=84fa458dfc854fba8ce578b6d826b9c8

下载链接

链接失效反馈

官方服务：

资源简介：

The accuracy of grape picking point localization is dependent on grape detection and semantic segmentation network performance. However, in practical application scenarios, the accuracy and segmentation precision of grape targets based on visible light images are susceptible to light variations and complex environments, often performing poorly. Moreover, grapes grow in bunches, and the existing multimodal datasets for apples and pears can hardly meet the recognition needs of bunch-shaped grapes. The construction of visible, depth, and near-infrared multimodal object detection and semantic segmentation datasets of grapes is crucial to exploring better recognition rates and stronger generalization capabilities for grape detection and semantic segmentation models. This dataset, totaling about 39.08 GB, contains high-quality multimodal video stream data of green and purple grapes, including six varieties, under different illumination and obscuration conditions. Additionally, the dataset offers 3954 labeled image samples extracted from the aforementioned multimodal video. By means of rotation, deflation, mis-slicing, panning, and Gaussian blur, the dataset can be augmented for the training implementation of mainstream deep learning models. The dataset can provide valuable basic data resources for multimodal fusion, grape semantic segmentation, and object detection, which have important practical application value for promoting research in the field of agricultural machinery and equipment intelligence.

葡萄采摘点定位的精度，取决于葡萄检测与语义分割（semantic segmentation）网络的性能表现。然而在实际应用场景中，基于可见光图像的葡萄目标检测（object detection）精度与分割精度易受光照变化与复杂环境影响，往往表现不佳。此外，葡萄呈串状生长，现有针对苹果、梨的多模态（multimodal）数据集难以满足串状葡萄的识别需求。构建可见光、深度与近红外（near-infrared）多模态的葡萄目标检测及语义分割数据集，对于提升葡萄检测与语义分割模型的识别精度、增强其泛化能力至关重要。本数据集总容量约39.08 GB，涵盖不同光照与遮挡条件下，6个品种的绿色与紫色葡萄的高质量多模态视频流数据。此外，本数据集还包含从上述多模态视频中提取的3954张标注图像样本。本数据集可通过旋转、缩放、错切、平移及高斯模糊等方式进行数据增强，以适配主流深度学习模型的训练需求。本数据集可为多模态融合、葡萄语义分割及目标检测任务提供宝贵的基础数据资源，对推动农业智能装备领域的研究具有重要的实际应用价值。

创建时间：

2024-01-31

搜集汇总

数据集介绍

背景与挑战

背景概述

该数据集是一个专注于葡萄多模态目标检测和语义分割的数据资源，总大小约39.08 GB，包含可见光、深度和近红外多模态视频流数据，涵盖绿色和紫色葡萄的六个品种在不同光照和遮挡条件下的采集样本，并提供了3954个标注图像样本。该数据集旨在支持深度学习模型训练，通过数据增强提升葡萄检测和分割的准确性与泛化能力，对农业机械设备智能化研究具有重要应用价值。

以上内容由遇见数据集搜集并总结生成