five

pyronear/openfire

收藏
Hugging Face2022-12-11 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/pyronear/openfire
下载链接
链接失效反馈
官方服务:
资源简介:
--- annotations_creators: - crowdsourced language_creators: - crowdsourced language: [] license: - apache-2.0 multilinguality: [] size_categories: - 1K<n<10K source_datasets: - original task_categories: - image-classification task_ids: [] pretty_name: Wildfire image classification dataset collected using images from web searches. --- # Dataset Card for OpenFire ## Table of Contents - [Table of Contents](#table-of-contents) - [Dataset Description](#dataset-description) - [Dataset Summary](#dataset-summary) - [Supported Tasks and Leaderboards](#supported-tasks-and-leaderboards) - [Languages](#languages) - [Dataset Structure](#dataset-structure) - [Data Instances](#data-instances) - [Data Fields](#data-fields) - [Data Splits](#data-splits) - [Dataset Creation](#dataset-creation) - [Curation Rationale](#curation-rationale) - [Source Data](#source-data) - [Annotations](#annotations) - [Personal and Sensitive Information](#personal-and-sensitive-information) - [Considerations for Using the Data](#considerations-for-using-the-data) - [Social Impact of Dataset](#social-impact-of-dataset) - [Discussion of Biases](#discussion-of-biases) - [Other Known Limitations](#other-known-limitations) - [Additional Information](#additional-information) - [Dataset Curators](#dataset-curators) - [Licensing Information](#licensing-information) - [Citation Information](#citation-information) - [Contributions](#contributions) ## Dataset Description - **Homepage:** https://pyronear.org/pyro-vision/datasets.html#openfire - **Repository:** https://github.com/pyronear/pyro-vision - **Point of Contact:** Pyronear <https://pyronear.org/en/> ### Dataset Summary OpenFire is an image classification dataset for wildfire detection, collected from web searches. ### Supported Tasks and Leaderboards - `image-classification`: The dataset can be used to train a model for Image Classification. ### Languages English ## Dataset Structure ### Data Instances A data point comprises an image URL and its binary label. ``` { 'image_url': 'https://cdn-s-www.ledauphine.com/images/13C08274-6BA6-4577-B3A0-1E6C1B2A573C/FB1200/photo-1338240831.jpg', 'is_wildfire': true, } ``` ### Data Fields - `image_url`: the download URL of the image. - `is_wildfire`: a boolean value specifying whether there is an ongoing wildfire on the image. ### Data Splits The data is split into training and validation sets. The training set contains 7143 images and the validation set 792 images. ## Dataset Creation ### Curation Rationale The curators state that the current wildfire classification datasets typically contain close-up shots of wildfires, with limited variations of weather conditions, luminosity and backrgounds, making it difficult to assess for real world performance. They argue that the limitations of datasets have partially contributed to the failure of some algorithms in coping with sun flares, foggy / cloudy weather conditions and small scale. ### Source Data #### Initial Data Collection and Normalization OpenFire was collected using images publicly indexed by the search engine DuckDuckGo using multiple relevant queries. The images were then manually cleaned to remove errors. ### Annotations #### Annotation process Each web search query was designed to yield a single label (with wildfire or without), and additional human verification was used to remove errors. #### Who are the annotators? François-Guillaume Fernandez ### Personal and Sensitive Information [More Information Needed] ## Considerations for Using the Data ### Social Impact of Dataset [More Information Needed] ### Discussion of Biases [More Information Needed] ### Other Known Limitations [More Information Needed] ## Additional Information ### Dataset Curators François-Guillaume Fernandez ### Licensing Information [Apache License 2.0](https://www.apache.org/licenses/LICENSE-2.0). ### Citation Information ``` @software{Pyronear_PyroVision_2019, title={Pyrovision: wildfire early detection}, author={Pyronear contributors}, year={2019}, month={October}, publisher = {GitHub}, howpublished = {\url{https://github.com/pyronear/pyro-vision}} } ```
提供机构:
pyronear
原始信息汇总

数据集概述

数据集基本信息

  • 名称: OpenFire
  • 类型: 图像分类数据集
  • 目的: 用于野火检测
  • 数据来源: 网络搜索
  • 语言: 英语
  • 许可证: Apache-2.0
  • 数据量: 1K<n<10K
  • 多语言性: 不适用
  • 任务类别: 图像分类

数据集结构

  • 数据实例: 每个数据点包含一个图像URL和一个二元标签(是否存在野火)。
  • 数据字段:
    • image_url: 图像下载URL。
    • is_wildfire: 布尔值,指示图像中是否存在野火。
  • 数据分割: 数据分为训练集和验证集,训练集包含7143张图像,验证集包含792张图像。

数据集创建

  • 采集与规范化: 数据通过DuckDuckGo搜索引擎收集,并经过手动清理。
  • 标注过程: 每个网络搜索查询产生一个标签(有或无野火),并进行了人工验证以移除错误。
  • 标注者: François-Guillaume Fernandez

使用考虑

  • 数据集影响: 待补充
  • 偏见讨论: 待补充
  • 其他已知限制: 待补充

附加信息

  • 数据集管理员: François-Guillaume Fernandez
  • 许可证信息: Apache License 2.0
  • 引用信息: 见README文件中的引用格式
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作