Marcel031/dior

Name: Marcel031/dior
Creator: Marcel031
Published: 2026-04-10 14:27:24
License: 暂无描述

Hugging Face2026-04-10 更新2026-04-12 收录

下载链接：

https://hf-mirror.com/datasets/Marcel031/dior

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: image_id dtype: int64 - name: image dtype: image - name: width dtype: int64 - name: height dtype: int64 - name: objects sequence: - name: bbox_id dtype: int64 - name: category dtype: class_label: names: '0': Airplane '1': Airport '2': Baseball field '3': Basketball court '4': Bridge '5': Chimney '6': Dam '7': Expressway service area '8': Expressway toll station '9': Golf course '10': Ground track field '11': Harbor '12': Overpass '13': Ship '14': Stadium '15': Storage tank '16': Tennis court '17': Train station '18': Vehicle '19': Wind mill - name: bbox sequence: int64 length: 4 - name: area dtype: int64 splits: - name: train num_bytes: 5902685454 num_examples: 18000 - name: test num_bytes: 1150035824 num_examples: 3463 - name: validation num_bytes: 645393741 num_examples: 2000 download_size: 7626168863 dataset_size: 7698115019 configs: - config_name: default data_files: - split: train path: data/train-* - split: test path: data/test-* - split: validation path: data/validation-* task_categories: - object-detection language: - en pretty_name: DIOR --- # DIOR Hugging Face-Ready Vision Dataset This dataset is a restructured version of the DIOR (Object Detection in Optical Remote Sensing Images), specifically designed to simplify object detection workflows. By converting them to the COCO format, this project provides an easier way to use DIOR with popular computer vision frameworks. Additionally, the dataset is formatted for seamless integration with Hugging Face datasets, unlocking new possibilities for training and experimentation. ## 📂 Dataset Structure ### COCO Format The dataset follows the COCO dataset structure, making it straightforward to work with: ```plaintext dior/ ├── annotations/ │ ├── instances_train.json │ ├── instances_val.json │ └── instances_test.json ├── train/ ├── val/ ├── test/ ``` ### Hugging Face Format The dataset is compatible with the datasets library. You can load it directly using: ```python from datasets import load_dataset dataset = load_dataset("HichTala/dior") ``` ## 🖼️ Sample Visualizations Above: An example of resized images with bounding boxes in COCO format. ## 🚀 Getting Started ### Install Required Libraries - Install datasets for Hugging Face compatibility: ```bash pip install datasets ``` - Use any object detection framework supporting COCO format for training. ### Load the Dataset #### Hugging Face: ```python from datasets import load_dataset dataset = load_dataset("HichTala/dior") train_data = dataset["train"] ``` #### Custom Script for COCO-Compatible Frameworks: ```python import json from pycocotools.coco import COCO coco = COCO("annotations/train.json") ``` see demo notebook [here](https://github.com/cocodataset/cocoapi/blob/master/PythonAPI/pycocoDemo.ipynb) for more details. ## 📚 Used in Research This processed version of DIOR has been used in the paper:\ 📄 [LoRA for Cross-Domain Few-Shot Object Detection](https://huggingface.co/papers/2504.06330)\ The dataset served as a target domain for evaluating the generalization capabilities of diffusion-based object detectors in low-data regimes. ## 📝 How to Cite If you use this dataset, please consider citing the original DIOR dataset: ```plaintext @article{Li_2020, title={Object detection in optical remote sensing images: A survey and a new benchmark}, volume={159}, ISSN={0924-2716}, url={http://dx.doi.org/10.1016/j.isprsjprs.2019.11.023}, DOI={10.1016/j.isprsjprs.2019.11.023}, journal={ISPRS Journal of Photogrammetry and Remote Sensing}, publisher={Elsevier BV}, author={Li, Ke and Wan, Gang and Cheng, Gong and Meng, Liqiu and Han, Junwei}, year={2020}, month=jan, pages={296–307}} ``` Additionally, you can mention this repository for the resized COCO and Hugging Face formats. Enjoy using DIOR in coco format for your object detection experiments! 🚀

提供机构：

Marcel031

搜集汇总

数据集介绍

构建方式

DIOR数据集是针对光学遥感图像中目标检测任务精心构建的大规模基准数据集。其构建过程遵循严格的图像采集与标注流程，从Google Earth等遥感图像源中搜集涵盖多样地理场景的影像，并基于20个具有代表性的目标类别（如飞机、港口、体育场等）进行人工边框标注。本版本对原始数据集进行了系统重构，将标注信息转换为COCO格式，并以Hugging Face兼容的格式重新组织，便于直接加载与使用。数据集划分了训练集（18000张图像）、测试集（3463张图像）和验证集（2000张图像），各子集均包含图像、目标类别及其边界框等结构化信息。

特点

DIOR数据集的核心特点在于其丰富多样、覆盖广泛的遥感场景类别，每张图像均包含精确的目标检测标注，且类别平衡性良好。数据集按COCO格式组织，搭配完整的JSON标注文件，包括实例分割与检测所需的目标ID、类别名称及边界框坐标等元数据。此外，该数据集已无缝集成至Hugging Face生态，用户可通过简单的API调用直接加载数据。其规模庞大、标注精细，能够有效支撑跨域少样本学习、遥感图像分析及通用目标检测研究，成为评估模型泛化能力与遥感场景理解的理想基准。

使用方法

使用DIOR数据集时，用户可借助Hugging Face的datasets库直接加载，仅需调用`load_dataset("HichTala/dior")`即可获取划分好的训练、验证与测试数据。对于基于COCO格式的深度学习框架，如Detectron2或MMDetection，可直接读取JSON标注文件进行训练。建议安装pycocotools库以处理COCO标准注解，并结合可视化工具检查边界框标注质量。研究人员还可参考官方Colab示例或论文《LoRA for Cross-Domain Few-Shot Object Detection》中展示的用法，将DIOR作为目标域，评估模型在低数据场景下的遥感目标检测性能。

背景与挑战

背景概述

DIOR（Object Detection in Optical Remote Sensing Images）数据集诞生于2020年，由西北工业大学的李珂、万刚、程塨、孟丽秋和韩军伟等研究人员联合发布。该数据集专为光学遥感图像中的目标检测任务而设计，涵盖20个常见遥感目标类别，包括飞机、机场、桥梁、船舶等，共计23463张高分辨率遥感图像及192472个精细标注的目标实例。DIOR的提出有力地推动了遥感领域目标检测技术的标准化评估，成为该领域最具影响力的基准数据集之一，被广泛应用于算法验证与模型比较。

当前挑战

DIOR数据集所面临的挑战主要体现在两个方面。从领域问题来看，遥感图像中的目标检测需应对视角多样、尺度差异巨大、目标密集排列以及背景复杂多变等难题，例如小型车辆与大型港口物体共存于同一场景，传统检测方法难以统一处理。从构建过程来看，数据集的标注工作面临大规模遥感图像中微小目标的精确定位困难，以及跨传感器、多分辨率图像的一致性标注质量保障挑战。此外，类别间样本不平衡问题显著，如‘车辆’类样本数量远超‘风车’类，对公平评估构成了额外障碍。

常用场景

经典使用场景

DIOR数据集专为光学遥感图像中的目标检测任务而设计，其经典使用场景集中于遥感领域中多类别、多尺度的地物目标识别。该数据集包含20个常见遥感目标类别，如飞机、港口、立交桥、体育场等，图像源自Google Earth，具有场景多样、分辨率高、样本丰富等特点。研究者常利用DIOR训练和评估基于深度学习的通用目标检测器，例如Faster R-CNN、YOLO和SSD，以验证模型在复杂遥感背景下的检测性能。同时，DIOR也是少样本学习、域适应及跨场景泛化研究中的重要基准，为遥感智能解译提供了标准化的评测平台。

解决学术问题

DIOR数据集的提出有效解决了遥感图像目标检测领域缺乏大规模、类别均衡且标注精细的公开基准这一核心学术难题。在此之前，遥感目标检测数据集多集中于少数类别或特定场景，难以支撑复杂环境下的模型训练与公平比较。DIOR通过提供统一标注格式的万级规模图像，使研究者能够在类别丰富、场景多样的条件下系统评估检测算法的鲁棒性与泛化能力。其出现推动了遥感目标检测从传统手工特征向深度学习的范式跃迁，尤其在多尺度目标定位、小目标识别、背景干扰抑制等关键问题的研究上具有重要意义，显著提升了遥感智能解译的学术水平和实用价值。

衍生相关工作

DIOR数据集催生了大量衍生研究工作，推动了遥感目标检测领域的持续创新。在方法论层面，研究者提出了面向遥感图像的旋转目标检测方法（如RoI Transformer、S2ANet）以及多尺度特征融合框架（如Feature Pyramid Networks），以更好地适应DIOR中地物的方向多样性与尺度差异。在应用拓展方面，基于DIOR的少样本目标检测、域自适应检测、弱监督定位等技术相继涌现，例如LoRA方法在跨域少样本检测中利用DIOR作为目标域评估模型泛化性。此外，DIOR还促进了遥感目标检测竞赛与开源平台的构建，成为连接传统遥感数据与当代深度学习框架的重要桥梁。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集