camvid数据集

Name: camvid数据集
Creator: 阿里云天池
Published: 2026-05-26 17:24:17
License: 暂无描述

阿里云天池2026-05-26 更新2025-04-19 收录

下载链接：

https://tianchi.aliyun.com/dataset/201729

下载链接

链接失效反馈

官方服务：

资源简介：

基本信息全称：Cambridge-Driving Labeled Video Database（剑桥驾驶标注视频数据库）。创建者：由英国剑桥大学计算机实验室与英国高速铁路公司合作创建，由 Gabriel J. Brostow 等人在 2008 年发布。数据来源：数据是从驾驶汽车的角度拍摄的，通过一辆装备有摄像头和 GPS 设备的汽车在剑桥市的城市街道上采集，涵盖各种环境、天气和道路状况。数据规模：包含 701 个视频序列，每个序列大约为 10 秒，共 701 张分辨率为 960×720 的图像以及和每个图像对应的标签。数据划分训练集：包含 367 对图像和标签，用于模型的训练过程，让模型学习如何识别和分类不同的图像元素。验证集：有 101 对图像和标签，在模型训练过程中用于实时评估模型性能，可据此调整模型参数和超参数，以及提前终止训练过程。测试集：共 233 对图像和标签，用于评估训练好的模型的性能，由于是模型未见过的数据，能客观地反映模型的泛化能力。标注信息类别数量：提供了 32 个 ground truth 语义标签，将每个像素与语义类别之一相关联，不过在通常的分割精度评估中，会使用 11 种常用类别，加上背景类别，共 12 类。标注形式：标签以颜色编码的形式呈现，每个颜色对应一个类别，方便可视化地观察每个类别，使得像素之间的关系更易于理解和解读。特点精细化标注：每个像素都被严格分类，能够确保模型学会更细致地 “看” 图像，精确地识别不同物体的边界和区域。实用性强：涵盖的类别丰富，包含道路、交通标志、汽车、天空、行人道、电线杆、围墙、行人、建筑物、自行车和树木等，满足多种现实世界需求，尤其适合自动驾驶、智能交通系统等领域的研究。社区支持活跃：作为一个开源项目，吸引了大量开发者和研究人员，形成了活跃的社区，大家可以共享经验与进展，推动相关研究和应用的发展。易于获取：可直接在项目仓库获取数据集，降低了进入门槛，鼓励更多人投身于视觉智能的研发之中。应用领域自动驾驶：帮助训练模型以准确区分道路上的不同对象，如行驶车道与人行道，识别复杂的街头标志等，提升自动驾驶的安全性和准确性。智能交通系统：为交通标志、行人、车辆等元素的识别提供支持，有助于提升交通管理的效率和准确性。图像分割研究：其丰富的标注和多样的场景可帮助研究人员更好地验证和优化图像分割算法，推动该领域的发展。其他领域：还适用于智能城市规划、无人机监测及任何依赖于高精度图像理解的应用场景。

Basic Information Full Name: Cambridge-Driving Labeled Video Database. Developer: Developed in collaboration between the Computer Laboratory of the University of Cambridge (UK) and a UK high-speed rail company, the dataset was published by Gabriel J. Brostow et al. in 2008. Data Source: Footage was captured from the perspective of a moving vehicle. Data was collected via a car equipped with cameras and GPS devices driving on urban streets in Cambridge, covering diverse environments, weather conditions, and road scenarios. Dataset Scale: The dataset contains 701 video sequences, each approximately 10 seconds in duration, along with a total of 701 images at a resolution of 960×720 and their corresponding annotation labels. Data Split Training Set: Comprising 367 pairs of images and their corresponding labels, this subset is used for model training, enabling the model to learn to recognize and classify distinct image elements. Validation Set: Consisting of 101 pairs of images and labels, this subset is utilized to evaluate model performance in real time during training, allowing for adjustment of model parameters and hyperparameters, as well as early termination of the training process. Test Set: With a total of 233 pairs of images and labels, this subset is used to evaluate the performance of the trained model. Since this data is unseen by the model, it can objectively reflect the model's generalization ability. Annotation Information Category Count: The dataset provides 32 ground truth semantic labels, which assign each pixel to one of the predefined semantic categories. However, in standard segmentation accuracy evaluation protocols, 11 commonly used categories plus the background class are adopted, resulting in a total of 12 categories. Annotation Format: Labels are presented in a color-coded manner, where each color corresponds to a specific category. This facilitates visual inspection of each category and makes the inter-pixel relationships easier to comprehend and interpret. Features Fine-grained Annotation: Every pixel is strictly categorized, ensuring that the model learns to perceive images with finer granularity and accurately identifies the boundaries and regions of different objects. High Practicality: It covers a comprehensive set of categories including roads, traffic signs, automobiles, sky, sidewalks, utility poles, fences, pedestrians, buildings, bicycles, trees, etc., meeting a wide range of real-world requirements, and is particularly well-suited for research in autonomous driving and intelligent transportation systems. Active Community Support: As an open-source project, it has attracted a large number of developers and researchers, fostering an active community where members can share experiences and research progress, thus promoting the advancement of related research and applications. Easy Access: The dataset is freely available directly from the project repository, lowering the barrier to entry and encouraging more individuals to engage in visual intelligence research and development. Application Fields Autonomous Driving: It aids in training models to accurately distinguish between different objects on roads, such as driving lanes and sidewalks, and recognize complex street signs, thereby enhancing the safety and accuracy of autonomous driving systems. Intelligent Transportation Systems: It provides support for the recognition of elements including traffic signs, pedestrians, vehicles, etc., helping to improve the efficiency and accuracy of traffic management. Image Segmentation Research: Its rich annotations and diverse scenarios enable researchers to better verify and optimize image segmentation algorithms, thus advancing the development of this field. Other Fields: It is also applicable to smart city planning, drone monitoring, and any application scenarios that rely on high-precision image understanding.

提供机构：

阿里云天池

创建时间：

2025-04-16

搜集汇总

数据集介绍

背景与挑战

背景概述

camvid数据集是一个用于自动驾驶和图像分割研究的公开数据集，由剑桥大学创建，包含701张驾驶视角的城市街道图像及像素级语义标签，划分为训练、验证和测试集。其特点在于提供精细化标注，涵盖道路、车辆、行人等32个类别，并以颜色编码呈现，适用于训练模型以提升视觉理解能力，尤其在智能交通和自动驾驶领域有广泛应用。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集