Francesco/smoke-uvylj

Name: Francesco/smoke-uvylj
Creator: Francesco
Published: 2023-03-30 09:32:38
License: 暂无描述

Hugging Face2023-03-30 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/Francesco/smoke-uvylj

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: image_id dtype: int64 - name: image dtype: image - name: width dtype: int32 - name: height dtype: int32 - name: objects sequence: - name: id dtype: int64 - name: area dtype: int64 - name: bbox sequence: float32 length: 4 - name: category dtype: class_label: names: '0': smoke-0 '1': smoke annotations_creators: - crowdsourced language_creators: - found language: - en license: - cc multilinguality: - monolingual size_categories: - 1K<n<10K source_datasets: - original task_categories: - object-detection task_ids: [] pretty_name: smoke-uvylj tags: - rf100 --- # Dataset Card for smoke-uvylj ** The original COCO dataset is stored at `dataset.tar.gz`** ## Dataset Description - **Homepage:** https://universe.roboflow.com/object-detection/smoke-uvylj - **Point of Contact:** francesco.zuppichini@gmail.com ### Dataset Summary smoke-uvylj ### Supported Tasks and Leaderboards - `object-detection`: The dataset can be used to train a model for Object Detection. ### Languages English ## Dataset Structure ### Data Instances A data point comprises an image and its object annotations. ``` { 'image_id': 15, 'image': <PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=640x640 at 0x2373B065C18>, 'width': 964043, 'height': 640, 'objects': { 'id': [114, 115, 116, 117], 'area': [3796, 1596, 152768, 81002], 'bbox': [ [302.0, 109.0, 73.0, 52.0], [810.0, 100.0, 57.0, 28.0], [160.0, 31.0, 248.0, 616.0], [741.0, 68.0, 202.0, 401.0] ], 'category': [4, 4, 0, 0] } } ``` ### Data Fields - `image`: the image id - `image`: `PIL.Image.Image` object containing the image. Note that when accessing the image column: `dataset[0]["image"]` the image file is automatically decoded. Decoding of a large number of image files might take a significant amount of time. Thus it is important to first query the sample index before the `"image"` column, *i.e.* `dataset[0]["image"]` should **always** be preferred over `dataset["image"][0]` - `width`: the image width - `height`: the image height - `objects`: a dictionary containing bounding box metadata for the objects present on the image - `id`: the annotation id - `area`: the area of the bounding box - `bbox`: the object's bounding box (in the [coco](https://albumentations.ai/docs/getting_started/bounding_boxes_augmentation/#coco) format) - `category`: the object's category. #### Who are the annotators? Annotators are Roboflow users ## Additional Information ### Licensing Information See original homepage https://universe.roboflow.com/object-detection/smoke-uvylj ### Citation Information ``` @misc{ smoke-uvylj, title = { smoke uvylj Dataset }, type = { Open Source Dataset }, author = { Roboflow 100 }, howpublished = { \url{ https://universe.roboflow.com/object-detection/smoke-uvylj } }, url = { https://universe.roboflow.com/object-detection/smoke-uvylj }, journal = { Roboflow Universe }, publisher = { Roboflow }, year = { 2022 }, month = { nov }, note = { visited on 2023-03-29 }, }" ``` ### Contributions Thanks to [@mariosasko](https://github.com/mariosasko) for adding this dataset.

--- dataset_info: 特征: - 名称: image_id 数据类型: int64 - 名称: image 数据类型: 图像 - 名称: width 数据类型: int32 - 名称: height 数据类型: int32 - 名称: objects 序列类型: - 名称: id 数据类型: int64 - 名称: area 数据类型: int64 - 名称: bbox 序列类型: float32 长度: 4 - 名称: category 数据类型: 类别标签: 名称映射: '0': smoke-0 '1': smoke 标注创建者: - 众包语言创建者: - 采集自公开数据源语言: - 英语许可协议: - CC 多语言属性: - 单语言样本量范围: - 1000 < 样本数 < 10000 源数据集: - 原生数据集任务类别: - 目标检测任务子项: [] 数据集简称: smoke-uvylj 标签: - RF100 --- # smoke-uvylj 数据集卡片 **原始COCO格式数据集存储于`dataset.tar.gz`** ## 数据集说明 - **主页**: https://universe.roboflow.com/object-detection/smoke-uvylj - **联系方式**: francesco.zuppichini@gmail.com ### 数据集概述 smoke-uvylj ### 支持任务与排行榜 - `目标检测`: 该数据集可用于训练目标检测模型。 ### 语言英语 ## 数据集结构 ### 数据实例每个数据样本包含一幅图像及其目标标注信息。 { 'image_id': 15, 'image': <PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=640x640 at 0x2373B065C18>, 'width': 964043, 'height': 640, 'objects': { 'id': [114, 115, 116, 117], 'area': [3796, 1596, 152768, 81002], 'bbox': [ [302.0, 109.0, 73.0, 52.0], [810.0, 100.0, 57.0, 28.0], [160.0, 31.0, 248.0, 616.0], [741.0, 68.0, 202.0, 401.0] ], 'category': [4, 4, 0, 0] } } ### 数据字段说明 - `image_id`: 图像唯一标识符 - `image`: 包含图像的`PIL.Image.Image`对象。请注意，当访问图像列时：`dataset[0]["image"]`会自动对图像文件进行解码。解码大量图像文件可能会耗费大量时间，因此建议优先通过样本索引再指定图像列进行查询，即**始终优先使用`dataset[0]["image"]`而非`dataset["image"][0]`** - `width`: 图像宽度 - `height`: 图像高度 - `objects`: 包含图像中目标的边界框元数据的字典 - `id`: 标注编号 - `area`: 边界框的面积 - `bbox`: 目标的边界框（采用[COCO](https://albumentations.ai/docs/getting_started/bounding_boxes_augmentation/#coco)格式） - `category`: 目标所属类别。 #### 标注人员说明标注人员为Roboflow平台用户。 ## 附加信息 ### 许可信息详见原主页: https://universe.roboflow.com/object-detection/smoke-uvylj ### 引用信息 @misc{ smoke-uvylj, title = { smoke uvylj 数据集 }, type = { 开源数据集 }, author = { Roboflow 100 }, howpublished = { url{ https://universe.roboflow.com/object-detection/smoke-uvylj } }, url = { https://universe.roboflow.com/object-detection/smoke-uvylj }, journal = { Roboflow Universe }, publisher = { Roboflow }, year = { 2022 }, month = { 11月 }, note = { 访问于 2023-03-29 }, } ### 贡献致谢感谢[@mariosasko](https://github.com/mariosasko)为本数据集的收录提供支持。

提供机构：

Francesco

原始信息汇总

数据集概述

数据集名称

名称: smoke-uvylj

数据集特征

特征:
- image_id: 整数类型
- image: 图像类型
- width: 整数类型
- height: 整数类型
- objects: 序列类型，包含以下子特征:
  - id: 整数类型
  - area: 整数类型
  - bbox: 序列类型，长度为4，浮点数类型
  - category: 类别标签，包含类别名称

数据集结构

数据实例:
- 每个数据点包含一张图片及其对象标注信息。
- 示例结构:
  
  { image_id: 15, image: <PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=640x640 at 0x2373B065C18>, width: 964043, height: 640, objects: { id: [114, 115, 116, 117], area: [3796, 1596, 152768, 81002], bbox: [ [302.0, 109.0, 73.0, 52.0], [810.0, 100.0, 57.0, 28.0], [160.0, 31.0, 248.0, 616.0], [741.0, 68.0, 202.0, 401.0] ], category: [4, 4, 0, 0] } }

数据字段

image: 图片对象
width: 图片宽度
height: 图片高度
objects: 包含对象信息的字典
- id: 标注ID
- area: 边界框面积
- bbox: 对象边界框
- category: 对象类别

数据集用途

任务: 对象检测

数据集详情

语言: 英语
许可证: cc
多语言性: 单语种
大小类别: 1K<n<10K
来源: 原始数据集
标签: rf100

搜集汇总

数据集介绍

构建方式

在计算机视觉领域，针对特定目标的检测任务需要高质量标注数据集的支持。smoke-uvylj数据集通过众包标注方式构建，其原始图像来源于公开资源，并由Roboflow平台的用户进行精细标注。数据采集过程遵循系统化流程，确保每张图像均包含规范的边界框与类别标签，最终形成适用于目标检测模型训练的结构化数据集合。

特点

该数据集专注于烟雾检测场景，涵盖两类目标类别，规模介于一千至一万样本之间，属于中等体量的专项数据集。其标注格式兼容通用目标检测框架，每个样本均提供图像标识符、尺寸信息及包含边界框坐标、面积与类别的物体标注字典。数据结构清晰，便于直接加载至主流深度学习平台进行端到端处理。

使用方法

研究者可利用该数据集直接训练烟雾检测模型，通过调用标准数据加载接口访问图像与标注信息。数据字段设计兼顾效率与兼容性，建议采用索引优先访问策略以优化图像解码性能。该数据集适用于评估目标检测算法在特定场景下的泛化能力，亦可用于迁移学习或数据增强策略的验证实验。

背景与挑战

背景概述

在计算机视觉领域，目标检测技术对于环境监测与公共安全具有关键意义。smoke-uvylj数据集由Roboflow机构于2022年创建，作为Roboflow 100（RF100）基准数据集的一部分，专注于烟雾检测这一特定任务。该数据集通过众包方式收集与标注，旨在为烟雾识别模型提供高质量的视觉数据支持，以应对火灾预警、环境监控等实际应用场景。其构建遵循COCO数据格式标准，促进了目标检测算法的统一评估与迭代优化，对提升智能监控系统的感知能力具有显著推动作用。

当前挑战

烟雾检测任务面临多重挑战：烟雾形态多变、透明度高且易受光照、天气等环境因素干扰，导致目标特征提取困难，模型易产生误检或漏检。在数据集构建过程中，众包标注虽提升了数据规模，但标注一致性难以保证，不同标注者对烟雾边界的判定可能存在主观差异。此外，数据集中类别不平衡问题突出，正负样本分布不均，可能影响模型训练的泛化性能。这些挑战共同制约了烟雾检测算法在实际复杂场景中的鲁棒性与准确性。

常用场景

经典使用场景

在计算机视觉领域，目标检测任务常需高质量标注数据以训练模型。smoke-uvylj数据集作为Roboflow 100系列的一部分，其经典使用场景聚焦于烟雾检测模型的开发与评估。该数据集通过众包方式标注了图像中的烟雾目标，提供了边界框及类别信息，为研究者构建和优化基于深度学习的烟雾识别算法提供了标准化的基准数据。

衍生相关工作

围绕smoke-uvylj数据集，衍生出多项经典研究工作。许多学者利用该数据集对现有目标检测架构进行适应性改进，提出了针对烟雾特征的轻量化模型，以平衡精度与推理速度。此外，结合数据增强与迁移学习技术，研究者开发了跨场景烟雾检测方案，增强了模型在多变环境下的鲁棒性。这些工作不仅丰富了烟雾检测领域的文献，也为Roboflow生态中其他类似数据集的构建与应用提供了参考范式。

数据集最近研究