野生蓝莓产量预测数据集
收藏阿里云天池2026-06-09 更新2025-10-11 收录
下载链接:
https://tianchi.aliyun.com/dataset/211780
下载链接
链接失效反馈官方服务:
资源简介:
---
Topic:
- 农业
- 植物
Field:
- 预测
- 数据挖掘
- 回归
Ext:
- .csv
DatasetUsage:
- 2834940
---
## **背景描述**
蓝莓在全球范围内备受欢迎,其独特的风味和丰富的营养价值令消费者为之倾倒。蓝莓生长对适宜气候和土壤的依赖,因此主要分布于北美、欧洲、澳洲等地区。
野生蓝莓养殖目前正处于蓬勃发展的阶段,吸引了越来越多的投资者和农户投身其中。全球对健康食品的需求不断增加,野生蓝莓以其天然的营养价值和丰富的抗氧化物质而备受瞩目。然而,养殖野生蓝莓也面临一系列挑战,包括气候不稳定、疾病威胁和市场价格波动。因此,成功的野生蓝莓养殖需要不断的创新和可持续的农业实践,以满足日益增长的全球市场需求。
蓝莓是多年生开花植物,浆果呈蓝色或紫色。它们被归类于越橘属中的蓝越橘科。越橘还包括小红莓、山桑子、胡越橘和马德拉蓝莓。商业蓝莓--野生(低丛)和栽培(高丛)--均原产于北美洲。高丛品种在 20 世纪 30 年代引入欧洲。
蓝莓通常是匍匐灌木,高度从 10 厘米(4 英寸)到 4 米(13 英尺)不等。在蓝莓的商业生产中,生长在低矮灌木丛中、浆果较小、豌豆大小的品种被称为 "低丛蓝莓"(与 "野生 "同义),而生长在较高、栽培灌木丛中、浆果较大的品种被称为 "高丛蓝莓"。加拿大是低丛蓝莓的主要生产国,而美国生产的高丛蓝莓约占全球供应量的 40%。

## **数据说明**
| 字段 | 说明 |
|-|-|
| Clonesize* | 蓝莓克隆平均大小,单位:$m^2$ |
| Honeybee | 蜜蜂密度(单位:$蜜蜂/m^2/分钟$ ) |
| Bumbles | 大型蜜蜂密度(单位:$大型蜜蜂/m^2/分钟$ ) |
| Andrena | 安德烈纳蜂密度(单位:$安德烈纳蜂/m^2/分钟$ ) |
| Osmia | 钥匙蜂密度(单位:$钥匙蜂/m^2/分钟$ ) |
| MaxOfUpperTRange |花期内最高温带日平均气温的最高记录,单位:${^{\circ}C}$ |
| MinOfUpperTRange | 花期内最高温带日平均气温的最低记录,单位:${^{\circ}C}$ |
| AverageOfUpperTRange | 花期内最高温带日平均气温,单位:${^{\circ}C}$ |
| MaxOfLowerTRange | 花期内最低温带日平均气温的最高记录,单位:${^{\circ}C}$ |
| MinOfLowerTRange | 花期内最低温带日平均气温的最低记录,单位:${^{\circ}C}$ |
| AverageOfLowerTRange | 花期内最低温带日平均气温,单位:${^{\circ}C}$ |
| RainingDays | 花期内降雨量大于 0 的日数总和,单位:天 |
| AverageRainingDays | 花期内降雨日数的平均值,单位:天 |
|fruitset|果实集|
|fruitmass|果实质量|
|seeds|种子数|
注:
Clonesize 表示每个蓝莓克隆株的平均占地面积大小。
蓝莓克隆(Blueberry clone)指的是蓝莓的克隆体。蓝莓繁殖和种植主要有两种方式:
1. 种子育种。从蓝莓果实中提取种子,播种育苗。这种方式育出来的蓝莓植株遗传特征会有很大变异。
2. 克隆繁殖。选取优良品种蓝莓母株,通过组织培养等焉条繁殖出基因特征高度一致的克隆蓝莓株。这种子植出来的蓝莓园,每个蓝莓株的性状和产量会趋于一致。
所以蓝莓克隆就指的是通过无性繁殖方式培育出来的蓝莓株。整个蓝莓园被同一个蓝莓品种的克隆株占满。
## **数据来源**
```
Qu, Hongchun; Obsie, Efrem; Drummond, Frank (2020), “Data for: Wild blueberry yield prediction using a combination of computer simulation and machine learning algorithms”, Mendeley Data, V1, doi: 10.17632/p5hvjzsvn8.1
```
## **问题描述**
* 蓝莓克隆大小与其他因素的关系分析
可以通过统计分析和数据可视化,探讨蓝莓克隆平均大小(Clonesize)与其他因素之间的关系
* 气温与蓝莓生长的关系分析
可以使用最高温带日平均气温(MaxOfUpperTRange、MinOfUpperTRange、AverageOfUpperTRange)和最低温带日平均气温(MaxOfLowerTRange、MinOfLowerTRange、AverageOfLowerTRange)等气象数据,分析它们与蓝莓果实集(fruitset)、果实质量(fruitmass)以及种子数(seeds)之间的关联
* 降雨对蓝莓生长的影响分析
使用降雨数据(RainingDays、AverageRainingDays),可以研究降雨对蓝莓的生长和生产是否有影响
* 机器学习预测模型
预测蓝莓克隆大小、果实集、果实质量或种子数等目标变量
## Topic:
- Agriculture
- Plant
## Field:
- Prediction
- Data Mining
- Regression
## Ext:
- .csv
## DatasetUsage:
- 2834940
---
## Background Description
Blueberries are globally popular, captivating consumers with their unique flavor and rich nutritional value. They rely on suitable climates and soils for growth, so they are mainly distributed in regions such as North America, Europe, and Australia.
The cultivation of wild blueberries is currently in a booming stage, attracting more and more investors and farmers to engage in this field. The global demand for healthy foods is continuously growing, and wild blueberries have gained widespread attention for their natural nutritional value and abundant antioxidants. However, cultivating wild blueberries faces a series of challenges, including unstable climates, disease threats, and fluctuations in market prices. Therefore, successful wild blueberry cultivation requires continuous innovation and sustainable agricultural practices to meet the increasing global market demand.
Blueberries are perennial flowering plants with blue or purple berries. They are classified in the section *Cyanococcus* within the genus *Vaccinium*. Other species in *Vaccinium* include cranberries, bilberries, huckleberries, and Madeira blueberries. Commercial blueberries—both wild (lowbush) and cultivated (highbush)—are native to North America. Highbush varieties were introduced to Europe in the 1930s.
Blueberries are generally creeping shrubs, ranging in height from 10 cm (4 inches) to 4 m (13 ft). In commercial blueberry production, varieties that grow in low, bushy formations with small, pea-sized berries are called "lowbush blueberries" (synonymous with "wild"), while varieties that grow in taller, cultivated bushes with larger berries are called "highbush blueberries". Canada is the leading producer of lowbush blueberries, while the United States produces approximately 40% of the global supply of highbush blueberries.

## Data Description
| Field | Description |
|-|-|
| Clonesize* | Average size of blueberry clones, unit: $m^2$ |
| Honeybee | Density of honeybees, unit: $bees/m^2/minute$ |
| Bumbles | Density of bumblebees, unit: $bumblebees/m^2/minute$ |
| Andrena | Density of *Andrena* bees, unit: $Andrena bees/m^2/minute$ |
| Osmia | Density of *Osmia* bees, unit: $Osmia bees/m^2/minute$ |
| MaxOfUpperTRange | Maximum recorded daily average temperature in the upper temperature range during flowering period, unit: ${^{circ}C}$ |
| MinOfUpperTRange | Minimum recorded daily average temperature in the upper temperature range during flowering period, unit: ${^{circ}C}$ |
| AverageOfUpperTRange | Daily average temperature in the upper temperature range during flowering period, unit: ${^{circ}C}$ |
| MaxOfLowerTRange | Maximum recorded daily average temperature in the lower temperature range during flowering period, unit: ${^{circ}C}$ |
| MinOfLowerTRange | Minimum recorded daily average temperature in the lower temperature range during flowering period, unit: ${^{circ}C}$ |
| AverageOfLowerTRange | Daily average temperature in the lower temperature range during flowering period, unit: ${^{circ}C}$ |
| RainingDays | Total number of days with rainfall greater than 0 during flowering period, unit: day |
| AverageRainingDays | Average number of rainy days during flowering period, unit: day |
| fruitset | Fruit set |
| fruitmass | Fruit mass |
| seeds | Number of seeds |
Note:
Clonesize refers to the average floor area occupied by each blueberry clone.
Blueberry clone refers to asexually propagated blueberry plants. There are two main methods for blueberry propagation and cultivation:
1. Seed breeding: Extract seeds from blueberry fruits and sow them for seedling cultivation. Blueberry plants grown through this method exhibit significant genetic variation.
2. Clonal propagation: Select elite blueberry mother plants, and propagate clonal blueberry plants with highly consistent genetic characteristics through methods such as tissue culture. Blueberry orchards grown through this method have plants with consistent traits and yields.
Thus, blueberry clones refer to blueberry plants cultivated through asexual propagation, where the entire orchard is planted with clonal plants of the same blueberry variety.
## Data Source
Qu, Hongchun; Obsie, Efrem; Drummond, Frank (2020), "Data for: Wild blueberry yield prediction using a combination of computer simulation and machine learning algorithms", Mendeley Data, V1, doi: 10.17632/p5hvjzsvn8.1
## Problem Description
* Analysis of the relationship between blueberry clone size and other factors: Statistical analysis and data visualization can be used to explore the relationship between the average blueberry clone size (Clonesize) and other factors.
* Analysis of the relationship between temperature and blueberry growth: Meteorological data such as the maximum daily average temperature in the upper temperature range (MaxOfUpperTRange, MinOfUpperTRange, AverageOfUpperTRange) and the maximum daily average temperature in the lower temperature range (MaxOfLowerTRange, MinOfLowerTRange, AverageOfLowerTRange) can be used to analyze their correlations with blueberry fruit set (fruitset), fruit mass (fruitmass), and number of seeds (seeds).
* Analysis of the impact of rainfall on blueberry growth: Rainfall data (RainingDays, AverageRainingDays) can be used to investigate whether rainfall affects blueberry growth and production.
* Machine learning prediction models: Predict target variables such as blueberry clone size, fruit set, fruit mass, or number of seeds.
提供机构:
阿里云天池
创建时间:
2025-10-08
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集专注于野生蓝莓的产量预测,包含克隆大小、蜜蜂密度、气温和降雨等多维度变量,适用于农业数据分析和机器学习建模。数据集提供了详细的字段说明和研究方向,支持从统计分析到预测模型构建的多种应用场景。
以上内容由遇见数据集搜集并总结生成



