pascal-voc

Name: pascal-voc
Creator: maas
Published: 2026-05-16 19:53:19
License: 暂无描述

魔搭社区2026-05-16 更新2025-07-19 收录

下载链接：

https://modelscope.cn/datasets/merve/pascal-voc

下载链接

链接失效反馈

官方服务：

资源简介：

## Pascal VOC #### Dataset Summary The Pascal Visual Object Classes (VOC) dataset is a widely used benchmark in the field of computer vision. It is designed for object detection, image classification, semantic segmentation, and action classification tasks. The dataset provides a comprehensive set of annotated images covering 20 object classes, allowing researchers to evaluate and compare the performance of various algorithms. **Note**: This dataset repository contains all editions of PASCAL-VOC, each file is identified with the year. #### Dataset Structure **Images:** The dataset contains 178k images. **Annotations:** Annotations include object bounding boxes, object class labels, segmentation masks, and action labels. **Classes:** 20 object classes: person, bicycle, car, motorbike, aeroplane, bus, train, boat, bird, cat, dog, horse, sheep, cow, elephant, bear, zebra, giraffe, and potted plant. **Supported Tasks** **Image Classification:** Assigning a label to an image from a fixed set of categories. **Object Detection:** Identifying objects within an image and drawing bounding boxes around them. **Semantic Segmentation:** Assigning a class label to each pixel in the image. **Action Classification:** Identifying the action being performed in the image. #### Applications The Pascal VOC dataset is used for: - Benchmarking and evaluating computer vision algorithms. - Training models for image classification, object detection, and segmentation tasks. #### Data Collection and Annotation **Data Sources** The images were collected from Flickr and other sources, ensuring a diverse and representative sample of real-world scenes. A**nnotation Process** Annotations were carried out by a team of human annotators. Each image is labeled with: - Bounding boxes for object detection. - Class labels for each object. - Pixel-wise segmentation masks for semantic segmentation. - Action labels indicating the action performed by the objects in the image. #### License The Pascal VOC dataset is released under the Creative Commons Attribution 2.5 License. Users are free to share, adapt, and use the dataset, provided appropriate credit is given. #### Citation If you use the Pascal VOC dataset in your research, please cite the following paper: ``` @article{Everingham10, author = {Mark Everingham and Luc Gool and Christopher K. I. Williams and John Winn and Andrew Zisserman}, title = {The Pascal Visual Object Classes (VOC) Challenge}, journal = {International Journal of Computer Vision}, volume = {88}, number = {2}, year = {2010}, pages = {303-338}, }

## 帕斯卡视觉对象类（Pascal Visual Object Classes, VOC）数据集 #### 数据集概述帕斯卡视觉对象类（VOC）数据集是计算机视觉领域广泛使用的基准测试集，专为目标检测、图像分类、语义分割以及动作分类任务设计。该数据集提供覆盖20个目标类别的全套标注图像，可供研究人员评估并对比各类算法的性能表现。 **注意**：本数据集仓库包含PASCAL-VOC的所有版本，每个文件均以年份标识。 #### 数据集结构 **图像**：本数据集共包含17.8万张图像。 **标注信息**：标注内容涵盖目标边界框、目标类别标签、分割掩码以及动作标签。 **类别**：共20个目标类别：人、自行车、汽车、摩托车、飞机、公共汽车、火车、船只、鸟类、猫、狗、马、绵羊、奶牛、大象、熊、斑马、长颈鹿以及盆栽植物。 **支持任务** **图像分类**：从固定类别集合中为图像分配对应标签。 **目标检测**：识别图像中的目标并为其绘制边界框。 **语义分割**：为图像中的每个像素分配类别标签。 **动作分类**：识别图像中目标所执行的动作。 #### 应用场景帕斯卡视觉对象类数据集可用于以下场景： - 对计算机视觉算法进行基准测试与性能评估 - 训练适用于图像分类、目标检测以及分割任务的模型 #### 数据采集与标注 **数据来源** 图像采集自Flickr及其他渠道，确保样本覆盖多样且具备真实场景的代表性。 **标注流程** 标注工作由人工标注团队完成，每张图像的标注内容包括： - 用于目标检测的边界框 - 每个目标的类别标签 - 用于语义分割的逐像素分割掩码 - 标注图像内目标所执行动作的标签 #### 许可协议帕斯卡视觉对象类数据集采用知识共享署名2.5许可（Creative Commons Attribution 2.5 License）协议发布。用户可自由共享、改编并使用该数据集，但需提供恰当的署名引用。 #### 引用说明若您在研究中使用帕斯卡视觉对象类数据集，请引用以下论文： @article{Everingham10, author = {Mark Everingham and Luc Gool and Christopher K. I. Williams and John Winn and Andrew Zisserman}, title = {The Pascal Visual Object Classes (VOC) Challenge}, journal = {International Journal of Computer Vision}, volume = {88}, number = {2}, year = {2010}, pages = {303-338}, }

提供机构：

maas

创建时间：

2025-07-18

搜集汇总

数据集介绍