智能识别车辆抛洒垃圾/杂物算法模型的图像训练数据
收藏浙江省数据知识产权登记平台2025-11-19 更新2025-11-26 收录
下载链接:
https://www.zjip.org.cn/home/announce/trends/8402555
下载链接
链接失效反馈官方服务:
资源简介:
本数据集主要用于提升AI模型对车辆抛洒垃圾、杂物等行为的识别能力与精确性。通过对该数据集的训练,使AI模型能够精准识别固体垃圾抛洒、液体泄漏、建筑废料抛洒等抛洒行为,并可应用于城市道路管理、环境卫生维护、交通违法治理及智慧城市平台等场景。同时,本数据集可为城市管理部门提供智能化执法依据,实现抛洒行为的自动识别与取证;为环卫部门提供精准作业指导,提升道路清洁效率;为交通管理部门降低巡查成本;为智慧城市平台提供多部门协同治理的数据支持,从而全面提升城市管理水平和环境质量。
1.数据采集
通过企业自有摄像设备自行采集道路车辆抛洒垃圾杂物图像,同步记录图像ID、采集时间、设备型号、地理坐标、光照条件、天气状况等数据。
2.数据预处理与标注
通过数据清洗剔除模糊、重复图像。按7:2:1比例划分训练集/验证集/测试集。设置多级标注体系:
一级标签:抛洒/未抛洒
二级标签:固体垃圾抛洒/液体泄漏/建筑废料抛洒
辅助标注:抛洒物边界框坐标、车辆边界框坐标
3.模型选择与初始化
采用YOLOv5m作为基础架构,初始化参数并优化超参数:学习率设置为0.01-0.001动态调整,批量大小1-32动态调整,锚框参数根据常见抛洒物尺寸定制,并集成注意力机制提升小目标检测能力。
4.模型训练
基于PyTorch框架实施两阶段训练,采用混合精度训练(FP16)提升效率。设置训练时长,数据增强模块模拟各类抛洒场景,包括:添加运动模糊、生成不同抛洒轨迹(抛物线/自由落体)、模拟雨雪天气干扰。设置早停机制(patience=10)和梯度裁剪(max_norm=1.2)。
5.模型评估
在训练模型的过程中,使用验证集调整超参数,训练完成后在测试集上评估模型表现,评估指标包含:
基础性能:mAP@0.5、误报率
场景鲁棒性测试:雨雾场景检出率
This dataset is primarily developed to improve the recognition performance and accuracy of AI models for identifying behaviors like vehicle-borne garbage dumping, debris scattering, and similar acts. Training AI models on this dataset enables them to accurately recognize dumping behaviors including solid waste dumping, liquid leakage, and construction waste scattering. It can be applied in scenarios such as urban road management, environmental sanitation maintenance, traffic violation governance, and smart city platforms. Meanwhile, this dataset can provide intelligent law enforcement basis for urban management departments, enabling automatic identification and evidence collection of dumping behaviors; offer precise operation guidance for sanitation departments to improve road cleaning efficiency; reduce patrol costs for traffic management departments; and provide data support for multi-department collaborative governance for smart city platforms, thereby comprehensively enhancing urban management levels and environmental quality.
1. Data Collection
Road images of vehicles dumping garbage and debris are collected independently using the enterprise's own camera equipment, with simultaneous recording of data such as image ID, collection time, device model, geographic coordinates, lighting conditions, and weather conditions.
2. Data Preprocessing and Annotation
Blur and duplicate images are removed via data cleaning. The dataset is split into training, validation, and test sets at a ratio of 7:2:1. A multi-level annotation system is established:
Primary labels: Dumping / Non-dumping
Secondary labels: Solid waste dumping / Liquid leakage / Construction waste scattering
Auxiliary annotations: Bounding box coordinates of dumped objects, bounding box coordinates of vehicles
3. Model Selection and Initialization
YOLOv5m is adopted as the basic architecture, with initialization of parameters and optimization of hyperparameters: the learning rate is dynamically adjusted between 0.01 and 0.001, the batch size is dynamically adjusted between 1 and 32, the anchor box parameters are customized based on the sizes of common dumped objects, and an attention mechanism is integrated to improve the detection capability for small targets.
4. Model Training
Two-stage training is implemented based on the PyTorch framework, with mixed-precision training (FP16) adopted to improve efficiency. Training duration is set, and the data augmentation module simulates various dumping scenarios, including: adding motion blur, generating different dumping trajectories (parabola/free fall), and simulating rain and snow weather interference. An early stopping mechanism (patience=10) and gradient clipping (max_norm=1.2) are set.
5. Model Evaluation
During the model training process, the validation set is used to adjust hyperparameters. After training is completed, the model performance is evaluated on the test set. The evaluation metrics include:
Basic performance: mAP@0.5, false positive rate
Scene robustness test: Detection rate in fog and rain scenarios
提供机构:
杭州声贝软件技术有限公司
创建时间:
2025-08-03
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集包含580条图像训练数据,每日更新,专用于训练AI模型以精准识别车辆抛洒垃圾、液体泄漏等行为,提升城市管理和环境维护效率。数据集基于YOLOv5m算法架构,通过多级标注和动态训练优化,支持道路管理、交通违法治理等智慧城市应用场景。
以上内容由遇见数据集搜集并总结生成



