Grape multimodal image dataset|农业智能化数据集|图像识别数据集

Mendeley Data2022-09-10 更新2024-06-27 收录

农业智能化

图像识别

下载链接：

https://www.doi.org/10.57760/sciencedb.j00001.00490

下载链接

链接失效反馈

资源简介：

Current research on grape picking point detection mainly relies on grape detection and segmentation using visible RGB images. As we know, the accuracy of grape detection and segmentation is very helpful in improving the recognition result and localization accuracy of grape picking points. However, there are significant differences in background saliency among different varieties of grapes (green, purple), and their recognition rate and segmentation accuracy are usually rather poor in natural scenarios, because of leaf occlusion, overlapping fruits, complex background, and ever-changing illumination. Additionally, the grapes differ greatly from the morphological characteristics of other fruits such as apples and pears, making it more difficult for the existing dataset to meet the research requirements of the detection and segmentation of bunch-shaped grape fruits. To this end, the construction of visible, depth and infrared-based multimodal image datasets of grapes is crucial to explore higher recognition rates and stronger generalization capabilities for grape detection and semantic segmentation models. This dataset offers high-quality multimodal image data of two grape varieties with the colors of green (white Rose) and purple (Giant Peak) under different lighting and shading conditions. The number of images is 300 and 500 each for green and purple grapes, with a total of 3938 targets and approximately 2.97 GB. By means of rotation, deflation, mis-slicing, panning, and Gaussian blur, the dataset can be augmented for training implementation of mainstream deep learning models. The dataset can provide valuable basic image data resources for multimodal image data fusion, grape semantic segmentation, and object detection, which has important practical application value for promoting research in the field of agricultural machinery and equipment intelligence.

创建时间：

2022-09-10

用户留言

有没有相关的论文或文献参考？

这个数据集是基于什么背景创建的？

数据集的作者是谁？

能帮我联系到这个数据集的作者吗？

这个数据集如何下载？

点击留言

数据主题

具身智能

数据集 4098个

机构 8个

大模型

数据集 439个

机构 10个

无人机

数据集 37个

机构 6个

指令微调

数据集 36个

机构 6个

蛋白质结构

数据集 50个

机构 8个

空间智能

数据集 21个

机构 5个

5,000+

优质数据集

54 个

任务类型

进入经典数据集

热门数据集

中国农村金融统计数据

该数据集包含了中国农村金融的统计信息，涵盖了农村金融机构的数量、贷款余额、存款余额、金融服务覆盖率等关键指标。数据按年度和地区分类，提供了详细的农村金融发展状况。

www.pbc.gov.cn 收录

LFW

人脸数据集;LFW数据集共有13233张人脸图像，每张图像均给出对应的人名，共有5749人，且绝大部分人仅有一张图片。每张图片的尺寸为250X250，绝大部分为彩色图像，但也存在少许黑白人脸图片。 URL: http://vis-www.cs.umass.edu/lfw/index.html#download

AI_Studio 收录

Nexdata/chinese_dialect

该数据集包含25,000小时的中文方言语音数据，收集自多个方言区域的本地方言使用者，涵盖闽南语、粤语、四川话、河南话、东北话、上海话、维吾尔语和藏语等。数据格式为16kHz、16bit、未压缩的wav文件，单声道。句子准确率超过95%。数据集支持的任务包括自动语音识别（ASR）和音频说话人识别。

hugging_face 收录

FSDD

FSDD（Free Spoken Digit Dataset）是一个开源的语音数据集，包含由不同说话者朗读的数字0到9的音频文件。该数据集旨在用于语音识别和机器学习算法的训练和测试。

github.com 收录

LinkedIn Salary Insights Dataset

LinkedIn Salary Insights Dataset 提供了全球范围内的薪资数据，包括不同职位、行业、地理位置和经验水平的薪资信息。该数据集旨在帮助用户了解薪资趋势和市场行情，支持职业规划和薪资谈判。