cansa/Describable-Textures-Dataset-DTD

Name: cansa/Describable-Textures-Dataset-DTD
Creator: cansa
Published: 2023-05-11 07:36:27
License: 暂无描述

Hugging Face2023-05-11 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/cansa/Describable-Textures-Dataset-DTD

下载链接

链接失效反馈

官方服务：

资源简介：

Describable Textures Dataset (DTD) 是一个不断发展的纹理图像集合，包含了人类感知属性的标注。该数据集包含5640张图像，分为47个类别，每个类别有120张图像。图像大小在300x300到640x640之间，且至少90%的图像表面代表了类别属性。数据集分为训练集、验证集和测试集，每个类别在每个集合中有40张图像。数据集的图像来自Google和Flickr，并通过Amazon Mechanical Turk进行标注。

提供机构：

cansa

原始信息汇总

Describable Textures Dataset (DTD) 概述

数据集描述

名称: Describable Textures Dataset (DTD)
内容: 包含5640张图像，分为47个类别，每个类别有120张图像。
图像特征: 图像大小在300x300至640x640像素之间，至少90%的表面代表类别属性。
来源: 图像通过在Google和Flickr上使用特定属性搜索获得。
标注: 每张图像提供主要类别（key attribute）和相关联的属性列表（joint attributes）。

数据集结构

分割: 数据集分为训练、验证和测试三部分，每部分包含每个类别的40张图像。
提供内容: 包含图像、基础真值标注（key和joint attributes）以及用于评估的10个数据分割。

数据集用途

目的: 旨在通过机器模拟人类对图像纹理的描述能力，深入理解纹理信息的处理、分析和表示。
应用: 用于计算机视觉领域的研究，特别是纹理分析和材料识别。

数据集下载

数据集文件:
- dtd-r1.0.1.tar.gz: 包含图像和基础真值标注。
- dtd-r1.0.1-labels.tar.gz: 包含标注和数据分割。
- dtd-r1-decaf_feats.tar.gz: 包含图像的DeCAF特征。

致谢

支持: 该研究得到多个机构和项目的支持，包括NSF、ODNI、JHU-HLTCOE和Google Research。
开发背景: 数据集的开发始于2012年Johns Hopkins大学的CLSP Summer Workshop。

搜集汇总

数据集介绍

构建方式

Describable Textures Dataset (DTD) is meticulously curated by collecting images from the internet, primarily using Google and Flickr as sources. Each image is selected to represent a specific texture, with a total of 5640 images spanning 47 distinct texture categories. To ensure that the images are representative, they are annotated with key attributes and a list of joint attributes that capture the essence of the texture. This annotation process involves multiple iterations using Amazon Mechanical Turk, ensuring the annotations are both accurate and comprehensive. The dataset is then split into three equal parts for training, validation, and testing, with 120 images per category in each split.

特点

The salient feature of the DTD dataset lies in its human-centric annotations and the diversity of textures it encompasses. With 47 categories, each containing 120 images, the dataset is rich in variety, offering a broad spectrum of textural patterns. The images are of high resolution, ranging from 300x300 to 640x640 pixels, and each image is carefully selected to ensure that at least 90% of its surface represents the category attribute. The annotations provided for each image include both a key attribute (main category) and joint attributes, offering a nuanced understanding of the texture. This dataset is particularly useful for research in texture analysis, material recognition, and the development of machine learning models that can describe and classify textures in a manner similar to human perception.

使用方法

Utilizing the DTD dataset involves downloading the package which contains the dataset images, along with the ground truth annotations and splits used for evaluation. The dataset is structured into three subsets for training, validation, and testing, with each subset containing 40 images per class. Researchers can leverage the annotations provided, which include key attributes and joint attributes, to train and evaluate machine learning models. The dataset also comes with pre-computed DeCAF features for each image, which can be used for feature extraction and model training. The README file provides detailed information on the dataset structure and the annotation process, guiding researchers on how to effectively use the dataset for their specific research needs.

背景与挑战

背景概述

纹理识别是计算机视觉领域中的一个重要研究方向，它不仅要求系统能够识别图像中的物体，还要能够描述物体的纹理特征。 describable Textures Dataset (DTD) 正是为了这一目的而创建的。该数据集由牛津大学视觉几何组的研究人员于2014年提出，旨在通过分析纹理的感知属性，帮助机器理解和描述纹理。DTD 数据集包含了5640张图像，按照47个与人类感知相关的类别进行组织。每个类别有120张图像，图像大小在300x300到640x640之间，至少有90%的表面代表类别属性。这些图像是通过在Google和Flickr上搜索提出的属性和相关术语收集的，并通过Amazon Mechanical Turk进行标注。DTD 数据集的发布为纹理分析和描述提供了宝贵的数据资源，对相关领域的研究产生了深远的影响。

当前挑战

尽管 DTD 数据集为纹理识别研究提供了丰富的数据资源，但仍然面临着一些挑战。首先，纹理识别的领域问题在于如何让机器理解和描述纹理的复杂性和多样性。DTD 数据集中的纹理具有丰富的感知属性，如颜色、形状、纹理和空间分布等，这些属性之间的相互作用使得纹理识别变得非常复杂。其次，在构建过程中，研究人员需要确保图像的标注质量和一致性，这对于训练有效的纹理识别模型至关重要。此外，由于纹理识别任务的复杂性，如何设计和评估纹理识别模型也是一个重要的挑战。

常用场景

经典使用场景

在计算机视觉领域中，描述性纹理数据集（DTD）广泛应用于纹理分析和图像理解的研究。其核心价值在于模拟人类视觉系统对纹理的感知和描述能力，通过对图像中的纹理属性进行标注，使得机器能够理解和描述纹理的细节特征。DTD数据集包含5640张图像，分为47个类别，每个类别包含120张图像，这些图像均从Google和Flickr上收集，并经过人工标注，确保了数据的质量和准确性。该数据集适用于纹理分类、纹理分割、纹理合成等多种研究任务，为纹理分析领域提供了宝贵的数据资源。

实际应用

DTD数据集在实际应用中具有重要的价值。在工业生产中，DTD数据集可以用于产品质量检测，通过对产品表面的纹理进行分析，判断产品的质量是否符合标准。在医学领域，DTD数据集可以用于病变检测，通过对病变组织的纹理进行分析，辅助医生进行诊断。此外，DTD数据集还可以用于图像检索、内容识别等领域，为这些领域的研究和应用提供了重要的数据支持。

衍生相关工作

DTD数据集的构建和发布，激发了纹理分析领域的研究热潮。许多研究者基于DTD数据集，开展了大量的研究工作，取得了丰硕的成果。例如，有研究者利用DTD数据集，提出了新的纹理描述方法，提高了纹理分类的准确率；有研究者利用DTD数据集，开发了新的纹理分割算法，提高了纹理分割的精度。这些研究成果，不仅推动了纹理分析技术的发展，也为DTD数据集的进一步应用提供了新的思路。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集