xdeng77/coconut_b
收藏Hugging Face2024-04-29 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/xdeng77/coconut_b
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc
dataset_info:
features:
- name: mask
dtype: image
- name: segments_info
struct:
- name: file_name
dtype: string
- name: image_id
dtype: int64
- name: segments_info
list:
- name: area
dtype: float64
- name: category_id
dtype: int64
- name: id
dtype: int64
- name: iscrowd
dtype: int64
- name: isthing
dtype: int64
- name: image_info
struct:
- name: coco_url
dtype: string
- name: date_captured
dtype: string
- name: file_name
dtype: string
- name: height
dtype: int64
- name: id
dtype: int64
- name: license
dtype: int64
- name: width
dtype: int64
splits:
- name: train
num_bytes: 1799961313.342
num_examples: 241602
download_size: 1641153199
dataset_size: 1799961313.342
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
## How to download
1. Set up environment
```
pip install datasets tqdm
wget https://raw.githubusercontent.com/bytedance/coconut_cvpr2024/main/download_coconut.py
```
2. Use the download script to download the COCONut dataset splits.
```
python download_coconut.py --split coconut_b # default split: relabeled_coco_val, need to switch to coconut_b
```
3. Download other COCONut dataset splits.
If you want to download the other splits, you can replace the split name to "relabeled_coco_val" or "coconut_s"
NOTE: multiple splits download is not yet supported.
```
python download_coconut.py --split relabeled_coco_val --output_dir relabeled_coco_val
```
4. The mask images are nearly black as we use continuous segment ids for each image, you can use [github visualization tutorial](https://github.com/bytedance/coconut_cvpr2024/blob/main/tutorials/visualization/demo.ipynb) to create colorful masks for viewing.
Please go to our offical github repo for detailed usage instruction: https://github.com/bytedance/coconut_cvpr2024
提供机构:
xdeng77
原始信息汇总
数据集概述
数据集特征
- mask: 图像类型数据。
- segments_info: 结构化数据,包含以下字段:
- file_name: 字符串类型。
- image_id: 整数类型。
- segments_info: 列表类型,包含:
- area: 浮点数类型。
- category_id: 整数类型。
- id: 整数类型。
- iscrowd: 整数类型。
- isthing: 整数类型。
- image_info: 结构化数据,包含以下字段:
- coco_url: 字符串类型。
- date_captured: 字符串类型。
- file_name: 字符串类型。
- height: 整数类型。
- id: 整数类型。
- license: 整数类型。
- width: 整数类型。
数据集分割
- train: 包含241,602个示例,数据大小为1,799,961,313.342字节。
数据集大小
- 下载大小: 1,641,153,199字节。
- 数据集大小: 1,799,961,313.342字节。
配置
- config_name: default
- data_files:
- split: train
- path: data/train-*



