COCO
收藏魔搭社区2025-12-05 更新2025-01-11 收录
下载链接:
https://modelscope.cn/datasets/HuggingFaceM4/COCO
下载链接
链接失效反馈官方服务:
资源简介:
# Dataset Card for [Dataset Name]
## Table of Contents
- [Table of Contents](#table-of-contents)
- [Dataset Description](#dataset-description)
- [Dataset Summary](#dataset-summary)
- [Supported Tasks and Leaderboards](#supported-tasks-and-leaderboards)
- [Languages](#languages)
- [Dataset Structure](#dataset-structure)
- [Data Instances](#data-instances)
- [Data Fields](#data-fields)
- [Data Splits](#data-splits)
- [Dataset Creation](#dataset-creation)
- [Curation Rationale](#curation-rationale)
- [Source Data](#source-data)
- [Annotations](#annotations)
- [Personal and Sensitive Information](#personal-and-sensitive-information)
- [Considerations for Using the Data](#considerations-for-using-the-data)
- [Social Impact of Dataset](#social-impact-of-dataset)
- [Discussion of Biases](#discussion-of-biases)
- [Other Known Limitations](#other-known-limitations)
- [Additional Information](#additional-information)
- [Dataset Curators](#dataset-curators)
- [Licensing Information](#licensing-information)
- [Citation Information](#citation-information)
- [Contributions](#contributions)
## Dataset Description
- **Homepage:** [https://cocodataset.org/](https://cocodataset.org/)
- **Repository:**
- **Paper:** [Microsoft COCO: Common Objects in Context](https://arxiv.org/abs/1405.0312)
- **Leaderboard:**
- **Point of Contact:**
### Dataset Summary
MS COCO is a large-scale object detection, segmentation, and captioning dataset.
COCO has several features: Object segmentation, Recognition in context, Superpixel stuff segmentation, 330K images (>200K labeled), 1.5 million object instances, 80 object categories, 91 stuff categories, 5 captions per image, 250,000 people with keypoints.
As of now, there is only the 2014 subset (with Karpathy annotations and splits), but feel free to contribute the 2017 subset of COCO!
### Supported Tasks and Leaderboards
[More Information Needed]
### Languages
[More Information Needed]
## Dataset Structure
### Data Instances
Each instance has the following structure:
```
{
'image': <PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=640x480 at 0x7F69C1BA8550>,
'filepath': 'COCO_val2014_000000522418.jpg',
'sentids': [681330, 686718, 688839, 693159, 693204],
'filename': 'COCO_val2014_000000522418.jpg',
'imgid': 1,
'split': 'restval',
'sentences': {
'tokens': ['a', 'woman', 'wearing', 'a', 'net', 'on', 'her', 'head', 'cutting', 'a', 'cake'],
'raw': 'A woman wearing a net on her head cutting a cake. ',
'imgid': 1,
'sentid': 681330
},
'cocoid': 522418
}
```
### Data Fields
[More Information Needed]
### Data Splits
[More Information Needed]
## Dataset Creation
### Curation Rationale
[More Information Needed]
### Source Data
#### Initial Data Collection and Normalization
[More Information Needed]
#### Who are the source language producers?
[More Information Needed]
### Annotations
#### Annotation process
[More Information Needed]
#### Who are the annotators?
[More Information Needed]
### Personal and Sensitive Information
[More Information Needed]
## Considerations for Using the Data
### Social Impact of Dataset
[More Information Needed]
### Discussion of Biases
[More Information Needed]
### Other Known Limitations
[More Information Needed]
## Additional Information
### Dataset Curators
[More Information Needed]
### Licensing Information
[More Information Needed]
### Citation Information
[More Information Needed]
### Contributions
Thanks to [@VictorSanh](https://github.com/VictorSanh) for adding this dataset.
# [数据集名称]数据集卡片
## 目录
- [目录](#table-of-contents)
- [数据集描述](#dataset-description)
- [数据集概述](#dataset-summary)
- [支持任务与基准测试榜](#supported-tasks-and-leaderboards)
- [语言](#languages)
- [数据集结构](#dataset-structure)
- [数据实例](#data-instances)
- [数据字段](#data-fields)
- [数据集划分](#data-splits)
- [数据集构建](#dataset-creation)
- [筛选依据](#curation-rationale)
- [源数据](#source-data)
- [标注信息](#annotations)
- [个人与敏感信息](#personal-and-sensitive-information)
- [数据集使用注意事项](#considerations-for-using-the-data)
- [数据集的社会影响](#social-impact-of-dataset)
- [偏差讨论](#discussion-of-biases)
- [其他已知局限性](#other-known-limitations)
- [附加信息](#additional-information)
- [数据集维护者](#dataset-curators)
- [许可信息](#licensing-information)
- [引用信息](#citation-information)
- [贡献声明](#contributions)
## 数据集描述
- **主页**:[https://cocodataset.org/](https://cocodataset.org/)
- **代码仓库**:
- **相关论文**:[Microsoft COCO: Common Objects in Context](https://arxiv.org/abs/1405.0312)
- **基准测试榜**:
- **联系方式**:
### 数据集概述
微软COCO(Microsoft COCO)是一个大规模目标检测、实例分割与图像描述数据集。COCO具备多项核心特性:目标分割、上下文场景识别、超像素事物分割、33万张图像(其中超过20万张带有标注)、150万个目标实例、80个目标类别、91个事物类别、每张图像对应5条描述文本、25万张带人体关键点标注的图像。
截至目前,本仓库仅包含2014版子集(采用Karpathy标注规则与数据集划分方式),欢迎社区贡献COCO 2017版子集!
### 支持任务与基准测试榜
[需补充更多信息]
### 语言
[需补充更多信息]
## 数据集结构
### 数据实例
每个数据实例的结构如下:
{
'image': <PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=640x480 at 0x7F69C1BA8550>,
'filepath': 'COCO_val2014_000000522418.jpg',
'sentids': [681330, 686718, 688839, 693159, 693204],
'filename': 'COCO_val2014_000000522418.jpg',
'imgid': 1,
'split': 'restval',
'sentences': {
'tokens': ['a', 'woman', 'wearing', 'a', 'net', 'on', 'her', 'head', 'cutting', 'a', 'cake'],
'raw': 'A woman wearing a net on her head cutting a cake. ',
'imgid': 1,
'sentid': 681330
},
'cocoid': 522418
}
### 数据字段
[需补充更多信息]
### 数据集划分
[需补充更多信息]
## 数据集构建
### 筛选依据
[需补充更多信息]
### 源数据
#### 初始数据收集与标准化
[需补充更多信息]
#### 源语言生成者是谁?
[需补充更多信息]
### 标注信息
#### 标注流程
[需补充更多信息]
#### 标注人员是谁?
[需补充更多信息]
### 个人与敏感信息
[需补充更多信息]
## 数据集使用注意事项
### 数据集的社会影响
[需补充更多信息]
### 偏差讨论
[需补充更多信息]
### 其他已知局限性
[需补充更多信息]
## 附加信息
### 数据集维护者
[需补充更多信息]
### 许可信息
[需补充更多信息]
### 引用信息
[需补充更多信息]
### 贡献声明
感谢[@VictorSanh](https://github.com/VictorSanh) 贡献本数据集。
提供机构:
maas
创建时间:
2025-08-01



