Construction of a Dataset for Knowledge Atlas of Cotton Diseases and Pests
收藏DataCite Commons2025-04-27 更新2025-05-18 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=d54df230cd06414b9b316dcb00944e07
下载链接
链接失效反馈官方服务:
资源简介:
Cotton is one of the important economic crops in China and one of the most important textile raw materials in the world. During the cotton planting process, various cotton diseases and pests have a significant impact on cotton yield. Constructing a knowledge map of cotton pests and diseases, and intelligentizing the names, symptoms, and prevention methods of cotton pests and diseases, and storing them in the form of a map, is of great significance for precise and rapid prevention and control of cotton pests and diseases. A dataset was constructed based on a knowledge graph of cotton diseases and pests. Based on books and websites related to cotton disease and pest control, unstructured data was collected through OCR technology and Python crawling. After cleaning and merging the data, 30 common cotton diseases and 49 cotton pest data were finally obtained. This dataset can be used to construct a knowledge map of cotton diseases and pests, providing data support for the development of informationization and intelligence in China's cotton planting industry.
棉花是中国重要的经济作物之一,亦是全球最为关键的纺织原料之一。在棉花种植过程中,各类病虫害会对棉花产量造成显著负面影响。构建棉花病虫害知识图谱,将棉花病虫害的名称、症状与防治方法进行智能化处理,并以图谱形式进行存储,对于实现棉花病虫害的精准快速防控具有重要意义。本数据集基于棉花病虫害防治相关的图书与网站资源,通过光学字符识别(OCR)技术与Python爬虫技术采集非结构化数据。经数据清洗与合并处理后,最终得到30种常见棉花病害与49种棉花虫害数据。本数据集可用于构建棉花病虫害知识图谱,为我国棉花种植业的信息化与智能化发展提供坚实的数据支撑。
提供机构:
Science Data Bank
创建时间:
2023-09-26
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集构建了一个棉花病虫害知识图谱,包含30种常见棉花病害和49种棉花害虫的数据,通过OCR技术和Python爬取从相关书籍和网站收集,经过清洗和合并处理,为棉花种植业的信息化和智能化提供数据支持。
以上内容由遇见数据集搜集并总结生成



