yizhangdev/pascal-voc

Name: yizhangdev/pascal-voc
Creator: yizhangdev
Published: 2026-04-15 06:32:43
License: 暂无描述

Hugging Face2026-04-15 更新2026-04-26 收录

下载链接：

https://hf-mirror.com/datasets/yizhangdev/pascal-voc

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: apache-2.0 task_categories: - object-detection - image-segmentation - image-classification --- ## Pascal VOC #### Dataset Summary The Pascal Visual Object Classes (VOC) dataset is a widely used benchmark in the field of computer vision. It is designed for object detection, image classification, semantic segmentation, and action classification tasks. The dataset provides a comprehensive set of annotated images covering 20 object classes, allowing researchers to evaluate and compare the performance of various algorithms. **Note**: This dataset repository contains all editions of PASCAL-VOC, each file is identified with the year. #### Dataset Structure **Images:** The dataset contains 178k images. **Annotations:** Annotations include object bounding boxes, object class labels, segmentation masks, and action labels. **Classes:** 20 object classes: person, bicycle, car, motorbike, aeroplane, bus, train, boat, bird, cat, dog, horse, sheep, cow, elephant, bear, zebra, giraffe, and potted plant. **Supported Tasks** **Image Classification:** Assigning a label to an image from a fixed set of categories. **Object Detection:** Identifying objects within an image and drawing bounding boxes around them. **Semantic Segmentation:** Assigning a class label to each pixel in the image. **Action Classification:** Identifying the action being performed in the image. #### Applications The Pascal VOC dataset is used for: - Benchmarking and evaluating computer vision algorithms. - Training models for image classification, object detection, and segmentation tasks. #### Data Collection and Annotation **Data Sources** The images were collected from Flickr and other sources, ensuring a diverse and representative sample of real-world scenes. A**nnotation Process** Annotations were carried out by a team of human annotators. Each image is labeled with: - Bounding boxes for object detection. - Class labels for each object. - Pixel-wise segmentation masks for semantic segmentation. - Action labels indicating the action performed by the objects in the image. #### License The Pascal VOC dataset is released under the Creative Commons Attribution 2.5 License. Users are free to share, adapt, and use the dataset, provided appropriate credit is given. #### Citation If you use the Pascal VOC dataset in your research, please cite the following paper: ``` @article{Everingham10, author = {Mark Everingham and Luc Gool and Christopher K. I. Williams and John Winn and Andrew Zisserman}, title = {The Pascal Visual Object Classes (VOC) Challenge}, journal = {International Journal of Computer Vision}, volume = {88}, number = {2}, year = {2010}, pages = {303-338}, }

提供机构：

yizhangdev

5,000+

优质数据集

54 个

任务类型

进入经典数据集