SUN09场景理解数据集

极市2022-02-15 更新2024-03-04 收录

下载链接：

https://www.cvmart.net/dataSets/detail/308

下载链接

链接失效反馈

官方服务：

资源简介：

AbstractWe introduce a new dataset (SUN 09) suitable for leveraging the contextual information. The dataset contains 12.000 annotated images covering a large number of scene categories (indoor and outdoors) with more than 200 object categories and 152.000 annotated object instances.Exploiting Hierarchical Context on a Large Database of Object CategoriesPresented at 2010 IEEE CVPRMyung Jin ChoiJoseph J. LimAntonio TorralbaAlan S. WillskyMassachusetts Institute of TechnologyThe SUN 09 DatasetBenchmark 1. For evaluation of a general object recognition system.Download all images (.jpg) and annotations (.xml) in the SUN 09 dataset :sun09.tar(5.2GB)static_sun09_database: 12,000 annotated imagesstatic_sun_objects: additional images to train baseline detectors (not used to train the context model)out_of_context: 42 out-of-context imagesBenchmark 2. For evaluation of a context model with precomputed baseline detector outputs.One important source of variation among context-based models is the performance difference of the baseline detectors. Here we provide a precomputed set of detector outputs on a fixed train/test split so that context models can be evaluated using the same baseline detectors. We use the baseline detectors by[Felzenszwalb et al.].Download baseline detector outputs in text files (.txt):detectorOutputsText.tar.gzThe file names correspond to [(test/train)/objectCategory/imageName.txt]Each line in the text file shows the bounding box locations and scores for one candidate window: [x1 y1 x2 y2 score]We use 4,367 training images and 4,317 test images. Each set has the same number of images per scene category.Download baseline detector outputs in MATLAB files (.mat):datasetMat.tar(159MB)Load sun09_detectorOutputs.mat for baseline detector outputs and sun09_groundTruth.mat for ground-truth annotations.AcknowledgmentsThe authors would like to thank Taeg Sang Cho for helpful discussions and feedback. This research was partially funded by Shell International Exploration and Production Inc., by Army Research Office under award W911NF-06-1-0076, by NSF Career Award (ISI 0747120), and by the Air Force Office of Scientific Research under Award No.FA9550-06-1-0324. Any opinions, findings, and conclusions or recommendations expressed in this publication are those of the author(s) and do not necessarily reflect the views of the Air Force.

摘要：我们提出了一款可用于上下文信息挖掘的全新数据集（SUN 09）。该数据集包含12000张标注图像，涵盖丰富的室内外场景类别，覆盖200余种物体类别，且标注了152000个物体实例。《基于大规模物体类别数据库的层级上下文挖掘》，发表于2010年IEEE计算机视觉与模式识别会议（IEEE CVPR）作者：Myung Jin Choi、Joseph J. Lim、Antonio Torralba、Alan S. Willsky 麻省理工学院（Massachusetts Institute of Technology, MIT） SUN 09 数据集基准测试1：面向通用物体识别系统的性能评估。下载SUN 09数据集的全部图像（.jpg格式）与标注文件（.xml格式）：sun09.tar（5.2GB） - static_sun09_database：12000张标注图像 - static_sun_objects：用于训练基准检测器（baseline detector）的额外图像（不用于训练上下文模型（context model）） - out_of_context：42张脱离上下文的图像基准测试2：基于预计算基准检测器输出的上下文模型性能评估。基于上下文的模型之间的核心差异来源之一，是基准检测器的性能表现差异。为此，我们提供了固定训练/测试划分下的预计算检测器输出集，使不同上下文模型可基于同一套基准检测器开展公平评估。本次评估采用Felzenszwalb等人提出的基准检测器。下载文本格式的基准检测器输出文件（.txt格式）：detectorOutputsText.tar.gz 文件命名规则为：[(测试集/训练集)/物体类别/图像名称.txt] 文本文件的每一行对应一个候选检测窗口的边界框（bounding box）位置与置信度分数，格式为：[x1 y1 x2 y2 score] 本次划分包含4367张训练图像与4317张测试图像，两类集合中每个场景类别的图像数量保持一致。下载MATLAB格式的基准检测器输出文件（.mat格式）：datasetMat.tar（159MB）可通过sun09_detectorOutputs.mat加载基准检测器输出数据，通过sun09_groundTruth.mat加载真实标注（ground-truth annotation）数据。致谢作者感谢Taeg Sang Cho提供的有益讨论与反馈。本研究部分由壳牌国际勘探与生产公司、陆军研究办公室（项目编号W911NF-06-1-0076）、美国国家科学基金会职业发展奖（项目编号ISI 0747120）以及空军科学研究办公室（项目编号FA9550-06-1-0324）资助。本文表达的任何观点、发现、结论或建议仅代表作者本人，未必反映空军的官方立场。

提供机构：

极市

搜集汇总

数据集介绍

背景与挑战

背景概述

SUN09场景理解数据集是一个由MIT发布的大规模数据集，包含12,000张标注图像，涵盖室内外多种场景，涉及超过200个对象类别和152,000个对象实例，主要用于对象识别和上下文模型评估。数据集提供两个基准测试：一个用于一般对象识别系统评估，另一个用于上下文模型评估，并包含预计算的检测器输出。文件大小为7.79GB，适用于目标检测任务。

以上内容由遇见数据集搜集并总结生成